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Preface 


We dedicate this book to the memory of J. Frank Adams. His clear 
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Hyland, P.T. Johnstone, A. Joyal, A. Kock, F.W. Lawvere, G.E. Reyes, 
R. Solovay, R. Swan, R.W. Thomason, M. Tierney, and G.C. Wraith. 
Our presentation combines ideas and results from these people and from 
many others, but we have not endeavored to specify the various original 
sources. Moreover, a number of people have assisted in our work by pro- 
viding helpful comments on portions of the manuscript. In this respect, 
we extend our hearty thanks in particular to P. Corazza, K. Edwards, 
J. Greenlees, G. Janelidze, G. Lewis, and S. Schanuel. 

Our work on the book has been supported by the Netherlands Sci- 
ence Foundation (NWO) and by the Department of Mathematics at the 
University of Chicago. Earlier related work was enthusiastically encour- 
aged by Frank Adams during a visit by one of us (S.M.) to Cambridge 
University in 1972. Moreover, lectures by $.M. on the occasion of several 
visits to the University of Heidelberg were encouraged by A. Dold and D. 
Puppe. We gratefully note that, from the beginning of our joint project 
in May of 1988, Peter May has made effective arrangements for many 
visits by I.M. to Chicago. We are likewise grateful to M.C. Pedicchio, 
who arranged for our joint visit in 1990 to The University of Trieste, 
where we wrote Chapter VIII. 

Our special thanks go to Walter Carlip, who typed up the entire 
manuscript with verve and understanding, and to Springer-Verlag for 
the smooth production of the resulting book. 


Saunders Mac Lane, Ieke Moerdijk 
Chicago and Utrecht, June 1991 
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Prologue 


A startling aspect of topos theory is that it unifies two seemingly 
wholly distinct mathematical subjects: on the one hand, topology and 
algebraic geometry, and on the other hand, logic and set theory. Indeed, 
a topos can be considered both as a “generalized space” and as a “gen- 
eralized universe of sets”. These different aspects arose independently 
around 1963: with A. Grothendieck in his reformulation of sheaf theory 
for algebraic geometry, with F. W. Lawvere in his search for an axioma- 
tization of the category of sets and that of “variable” sets, and with Paul 
Cohen in the use of forcing to construct new models of Zermelo—Frenkel 
set theory. 

The study of cohomology for generalized spaces led Grothendieck 
to define his notion of a topos. The cohomology was to be one with 
variable coefficients—for example, varying under the action of the fun- 
damental group, as in N. E. Steenrod’s work in algebraic topology, or, 
more generally, varying in a sheaf. The notion of a sheaf has its origins 
in the analytic continuation of functions, as initiated in the 19th cen- 
tury and then formulated rigorously in H. Weyl’s famous book on the 
“idea” of the Riemann surface. For several complex variables the study 
of domains of holomorphy and of the Cousin problems gradually led 
H. Cartan and K. Oka in the 1940’s to study ideals on a domain. They 
were in effect sheaves; thus Cartan in 1944 spoke of “coherent systems 
of punctual ideals”, while in 1949 Oka discussed “ideals with indeter- 
minate domain”. Then shortly after World War II, J. Leray published 
the first general and explicit definition of a sheaf on a space, described 
in terms of the closed sets of that space. H. Cartan, building on the 
ideas of Leray, rephrased the definition of sheaves in terms of open sets 
in his seminars of 1948-49 and 1950-51; in the course of these seminars, 
Lazard introduced the equivalent definition of a sheaf on a space X as 
an étale map into X. The subtle equivalence between these two notions 
is a central motivation of topos theory. 

Roughly speaking, a sheaf A of abelian groups on a topological space 
X is a family of abelian groups A,, parametrized by the points x € 
X in a suitably “continuous” way. This means in particular that the 
disjoint union ]] Az of all these groups is a space, so topologized that 
the projection of this space into X (sending each group A, to the point 
z) is continuous and also étale, in the sense that the topology on the 
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disjoint union is “horizontal” to match the topology of X, while the 
algebra (the abelian group structure of the various A,) is vertical, along 
each fiber A,. For each open set U C X one can then consider the 
“sections” s over U of the sheaf A: each section is a function which 
selects—again in a suitably continuous way—for each point « € U an 
element s(x) in the corresponding abelian group A,. Thus, given a 
smaller open set V C U, each section s over U can be restricted to the 
smaller V. And conversely, the whole section s over U can be recovered 
by collating the restrictions of s to each of the smaller open sets V; in 
some covering U = |JV;. For example, two sections s over V and s’ 
over V’ yield a new section sUs’ over VUV’, provided s coincides with 
s’ on the overlap VV’. Then the sheaf A can be described wholly 
in terms of all these sections s for all open sets U, together with these 
operations of restriction and collation. With this development of the 
notion of sheaf it became possible to define a corresponding cohomology 
of a topological space with sheaf coefficients. 


Then J. P. Serre and others realized that such sheaves could be used 
not only in topology but also in algebraic geometry, and that the con- 
struction of a sheaf on a space X could proceed from sections s defined 
on objects U which were not necessarily subsets U C X, but simply 
mappings U — X from some other space U into X. Thus, ideas from 
category theory entered, even though the tradition of Bourbaki pro- 
scribed the use of such terms. They led Grothendieck to define sheaves 
in a general context, replacing the partially ordered collection of open 
subsets of a space by objects of a category C in which suitable families 
of maps U; — X (for i € I) form “covers” of objects X in C. Then for 
such a “Grothendieck topology” a sheaf became something—indeed be- 
came a functor-—which could be suitably collated over each such cover. 
With this general notion of sheaf, various cohomologies could be for- 
mulated in a long range attack on the Weil conjectures about solutions 
of polynomial equations. In the early sixties, these remarkably general 
ideas were rapidly developed by A. Grothendieck and his collaborators— 
J. L. Verdier, M. Artin, M. Giraud, M. Hakim, L. Illusie, and others. 
The results, initially recorded in a semi-secret document, SGA IV, later 
appeared as an expanded “SGA IV” in three volumes of the Springer 
Lecture Notes, for a total of 1623 pages. They were widely influential on 
the whole structure of algebraic geometry and in particular eventually 
led to the solution of the Weyl conjectures by P. Deligne in 1974. 


For Grothendieck, topology became the study of (the cohomology 
of) sheaves, and the sheaves “sited” on a given Grothendieck topology 
formed a topos—subsequently called a Grothendieck topos. Since the 
very notion of sheaf is thus central to topos theory, Chapter II will de- 
velop properties of sheaves on a topological space, so as to introduce 
the intuition of sheaves, both in terms of fibers pasted together and in 
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terms of sections which can be restricted and collated. The equivalence 
between these two notions will be discussed, as well as the way in which 
a continuous map between spaces leads to a geometric morphism—a 
suitable pair of adjoint functors between the sheaf categories. Chap- 
ter III introduces the more general notion of coverings in a category (a 
Grothendieck topology), the resulting “sites”, as well as the topos formed 
as the category of all the sheaves of sets on such a site. Furthermore, 
it will be explained how any functor on a site can be transformed, in 
two steps, into a sheaf; this process of “sheafification” provides another 
basic example of a pair of adjoint functors. 


Thus, categories (from 1945) and adjoint functors between them, as 
revealed by D. M. Kan in 1957, form a language indispensable for the 
organization and understanding of our subject. Our categorical prelim- 
inaries (before Chapter I) may serve to remind the reader of such indis- 
pensable notions as “pullback”, “adjoint functor”, etc., while Chapter I 
will inter alia introduce a ubiquitous adjunction which will later provide 
a remarkable wealth of tensor products (Chapter VII). 


Categories, initially a convenient way of formulating exact sequences, 
diagram chasing, and axiomatic homology for topology, acquired inde- 
pendent life in the work of Ehresmann and his students in France, and 
in the United States in the work of Kan and Mac Lane, and in a group 
around Eilenberg at Columbia, which included in particular Barr, Freyd, 
Gray, Lawvere, Linton, and Tierney. Then in 1963 Lawvere embarked 
on the daring project of a purely categorical foundation of all mathe- 
matics, beginning with an appropriate axiomatization of the category 
of sets, thus replacing set membership by the composition of functions. 
When Lawvere heard of the properties of Grothendieck topoi, he soon 
observed that such a topos admits basic operations of set theory such 
as the formation of sets Y* of functions (all functions from X to Y) 
and of power sets P(X) (all subsets of X). At about the same time 
M. Tierney saw that Grothendieck’s work could lead to an axiomatic 
study of sheaves. Subsequently, Lawvere and Tierney, working together 
at Dalhousie University, discovered an effective axiomatization of cat- 
egories of sheaves of sets (and, in particular, of the category of sets) 
via an appropriate formulation of set-theoretic properties. Thus they 
defined in an elementary way, free of all set-theoretic assumptions, the 
notion of an “elementary topos”. The early definition underwent several 
changes and modifications to yield a final axiomatization of a beautiful 
and amazing simplicity: an elementary topos is a category with finite 
limits, function objects Y* (defined as adjoints) for any two objects X 
and Y, and a power object P(X) for each object X; these are required to 
satisfy some simple basic axioms, much like the first-order properties of 
ordinary function sets and power sets in naive set theory. Chapter I will 
begin our exposition by exhibiting the construction of these function 
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objects and power objects in several concrete examples of elementary 
topoi—notably in categories of set-valued functors (presheaves). 

Every Grothendieck topos is an elementary topos, but not con- 
versely: the axiomatization by Lawvere and Tierney is both elementary 
(first-order logic, with no reference to set theory) and more inclusive 
than Grothendieck’s. Nonetheless, many of the basic properties of sets 
and functions, and of sheaf categories, can be developed on the basis 
of the Lawvere-Tierney axioms, as shown in Chapter IV. Furthermore, 
Grothendieck’s definition of topology in terms of coverings can be refor- 
mulated for any elementary topos in terms of “coverings” of subobjects, 
giving rise to a theory of sheaves and sheafification relative to a topos, 
as described in detail (among other things) in Chapter V. 

Lawvere’s basic idea, as noted above, was that a topos is a “uni- 
verse of sets”. In Chapter VI, we will take up this idea and compare 
it with some of the developments in set theory. Our first example is 
a topos-theoretic presentation of Cohen’s work on the independence of 
the Continuum Hypothesis. The Continuum Hypothesis goes back to 
G. Cantor and can be formulated thus: any infinite subset B C R of 
the real line has either the same cardinality as the real line itself, or is 
denumerable (i.e., has the cardinality of the set N of natural numbers). 
Gédel had already shown in 1938 that the Continuum Hypothesis does 
not contradict the usual (Zermelo—Freenkel) axioms of set theory, but 
for a long time it was unclear whether or not the Continuum Hypoth- 
esis follows from these Zermelo—Freenkel axioms. In 1963, Paul Cohen 
showed that this was not the case; his method was what is now called 
“Cohen forcing”. 

Since the cardinality of the set R of reals is the same as that of the 
powerset P(N) of the set of natural numbers, Cohen’s problem can be 
phrased as follows: find a set B and injective functions 


N-——— B-——> P(N) (1) 


such that there exists no surjection N —» B, and no surjection B—» P(N); 
thus the cardinality of the set B lies strictly between the cardinality of 
N and that of P(N). To do this, Cohen considered a “universe” of sets 
(a model of set theory) and then expanded this universe by “forcing” 
an altogether new set B of subsets of N into this expanded universe, 
so that in this new universe the cardinality of B is strictly between the 
cardinalities of N and P(N). This technique of expanding the “universe” 
by forcing was later rephrased by R. Solovay and D. S. Scott in terms of 
Boolean-valued models, where the truth predicate takes values not just 
“true” and “false”, but all values in an arbitrary Boolean algebra. 
Shortly after this, Lawvere and Tierney made the remarkable dis- 
covery that Cohen’s forcing technique could be explained in terms of 
topoi: indeed, using exactly Cohen’s constructions, one obtains a topos 
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(a category of sheaves), in which there exists a sheaf B which lies strictly 
between the sheaf of natural numbers N and its power sheaf P(N), as 
in (1). Chapter VI will extensively discuss this sheaf-theoretic version 
of Cohen’s proof. 

Cohen also used similar methods to show that the Axiom of Choice 
cannot be derived from the usual Zermelo—Freenkel axioms of set theory. 
More recently, P. Freyd gave an elegant and noticeably simpler sheaf- 
theoretic proof of this fact. Freyd’s construction will be presented in 
Chapter VI. 

Around the same time as Cohen, but evidently independently, the 
logician S. Kripke discovered semantical interpretations, first of modal 
logic and shortly after of intuitionistic logic, which bore a striking sim- 
ilarity to some aspects of Cohen’s forcing technique. Sheaf theory also 
explains the relation between Cohen’s forcing and Kripke’s models for 
intuitionistic logic. 

Intuitionistic logic, and the mathematics based on it, originated with 
Brouwer’s work on the foundations of mathematics, at the beginning of 
this century. He defined real numbers by choice sequences and insisted 
that all proofs be constructive. This meant that he did not allow proof 
by contradiction and hence that he excluded the classical tertium non 
datur (for all p, either p, or not p). His approach was not formal or ax- 
iomatic, but subsequently Heyting and others introduced formal systems 
of intuitionistic logic, weaker than classical logic. This may suggest that 
intuitionistic mathematics is a proper part of ordinary mathematics, but 
this is not so: for example, in intuitionistic mathematics a suitable de- 
scription of real numbers R leads to the result that all functions R — R. 
are continuous, as was already shown by Brouwer. 

In a topological space the complement of an open set U is closed but 
not usually open, so among the open sets the “negation” of U should be 
the interior of its complement. This has the consequence that the double 
negation of U is not necessarily equal to U. Thus, as observed first by 
Stone and Tarski, the algebra of open sets is not Boolean, but instead 
follows the rules of the intuitionistic propositional calculus. Since these 
rules were first formulated explicitly by A. Heyting, such an algebra is 
called a Heyting algebra. The “truth values” of any topos constitute 
such a Heyting algebra. The basic properties of these Heyting algebras 
are formulated in Chapter I. 

From this point of view it is not surprising that subobjects in a 
category of sheaves have a negation operator which belongs to a Heyting 
algebra. Moreover, there are quantifier operations on sheaves, defined 
by adjunction, which have exactly the properties of the corresponding 
quantifiers in intuitionistic logic. This leads to the remarkable result, 
foreshadowed by the observation of Stone and Tarski as well as by Scott’s 
topological models, that the “intrinsic” logic of a topos is in general 
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intuitionistic. However, there can be particular sheaf categories, such as 
those constructed by Cohen and by Freyd, where the intuitionistic logic 
becomes ordinary (classical) logic. 

Kripke’s semantics for intuitionistic logic can also be viewed as a de- 
scription of truth for the language of a suitable topos. And as a further 
illustration of the way in which topos theory incorporates Brouwer’s 
ideas we will present at the end of Chapter VI a particular topos to- 
gether with its real numbers R in which all functions R. — R are indeed 
continuous. 

Together with the notion of a topos, there is the notion of a map— 
or a “geometric morphism”-—between two topoi, defined as a pair of 
adjoint functors having certain additional exactness properties. 

A more familiar example of a pair of adjoint functors as a map comes 
from ring theory. If R and S are commutative rings, consider a left 
R- and right S-module M. For each left R-module B, the module M 
then yields by “homming” a left S-module Homr(M,B). In the other 
direction, each left S-module A yields by tensor product a left R-module 
M @g A. The “tensor product” functor 


Mz — : (S —- Mod) — (R —- Mod) 
between module categories is actually a left adjoint to the Hom-functor 
Homp,(M, —-): (R — Mod) — (5 — Mod) 
because of the familiar isomorphism (left adjoint on the left) 
Hompr(M @¢ A, B) = Homs(A, Homr(M, B)) 


between the corresponding Hom-sets. Moreover, when M is flat as an 
S-module (for example, when the ring S is a field) the functor M @g — 
preserves kernels and hence exact sequences, so is an “exact” functor. 
For topoi, the definition of geometric morphisms is modeled on the 
case of a continuous map between topological spaces. Indeed, such a 
map f: Y — X induces operations in both directions on sheaves. Thus, 
if we regard a sheaf A on the codomain space X as a family A, of sets 
parametrized by the points x of X, then f induces a family As,,) of sets 
parametrized by the points y € Y. The resulting sheaf on Y is called 
the “inverse image” of A and denoted by f*A. On the other hand, a 
sheaf B on Y, regarded as a family of sections s over open sets V of Y, 
yields a new family of sections over open sets U of X by composition 
with f: the sections over such a U are exactly the composites so f where 
s is a section of B over the open set f-1(U) € Y. These new sections 
over open sets in X form a sheaf f,.B over the space X called the direct 
image of B. These two operations of inverse and direct image constitute 
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a pair of adjoint functors, for which the inverse image f* is left adjoint 
to the direct image f,. Moreover, the left adjoint f* is (left) exact, in 
the sense that it preserves finite limits. 

By definition, a “geometric morphism” f: 7 — € between any two 
topoi € and F is such a pair of adjoint functors 


fri EF, fe: FOE, 


where the left adjoint is required to be left exact. Such geometric mor- 
phisms arise not only from continuous maps between topological spaces, 
but also in many seemingly quite different contexts, as will be demon- 
strated in Chapter VII. This chapter will also show that any geometric 
morphism between (Grothendieck) topoi can be viewed as a Hom-tensor 
adjunction, very similar to the familiar such adjunction for modules as 
just described. For topoi, exactness of the left adjoint (the tensor prod- 
uct) will again be analyzed in terms of a notion of “flatness” . 

In topology, continuous maps lead naturally to the construction of 
classifying spaces. For example, there is a classifying space B for (com- 
plex) vector bundles. This means that there is a standard (“universal” ) 
vector bundle E over B such that, for any space X, maps from X into 
B correspond via £ to vector bundles on X: the standard bundle E 
over B “pulls back” along any map X — B to produce a bundle over 
X, and every vector bundle over X is such a pullback of the standard 
bundle E. There is a similar “classifying space” for cohomology: For 
any integer n > 0 and any abelian group 7 the Eilenberg—Mac Lane 
space K (7,7) classifies cohomology, in the sense that for any space X 
(homotopy classes of ) maps from X into K(,n) correspond to elements 
in the (singular) cohomology group H"(X, 7). 

In a similar way, many sorts of mathematical structures can be “clas- 
sified” by a suitable topos. For example, since a topos has products of 
objects, one can readily describe ring-objects in a topos. There is a spe- 
cial topos R, with a “universal” ring-object R in R, which is a classifying 
topos for ring-objects in topoi. This means that geometric morphisms 
f: € > R® correspond exactly to ring-objects S in €: the inverse image 
of such a morphism will carry the universal ring R in R to a ring f*(R) 
in €, and any ring-object S in € is of the form f*(R) for a suitable 
geometric morphism f. 

As an introduction to the properties of “classifying topoi”, we will 
present this example of the classifying topos R in Chapter VIII. In the 
discussion of this and other examples, the construction of the required 
geometric morphisms makes use of the general Hom-tensor adjunction of 
Chapter VII. This adjunction makes another appearance in Chapter X, 
which provides a general existence theorem for classifying topoi. It is 
shown that for any mathematical structure, which can be described by 
“geometric” axioms in a suitable language, there is a classifying topos. 
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The proof makes use of models of the language in various topoi, and 
relates to earlier uses (in Chapter VI) of formal languages in the context 
of topoi. 

A topos is, in a suitable sense, a generalized space, so should have 
(generalized!) points. Indeed, at a given point x of an ordinary topo- 
logical space X, one can erect each set A as a sort of “skyscraper” sheaf 
A, on X concentrated around the point x. The resulting mapping from 
the category of sets into that of sheaves on X is, in fact, the direct im- 
age of a geometric morphism Sets — Sh(X). But an arbitrary topos 
E may not have enough “points” Sets — € in this sense. In order to 
develop an adequate comparison between topoi and spaces, it is useful 
to alter the definition of a space by describing a space not in terms of its 
points, but in terms of its open sets. The objects so defined by a lattice 
of open sets are called locales. Since sheaves can be described in terms 
of coverings by open sets, one can construct a topos Sh(X) consisting 
of all the sheaves of sets on such a locale X. Moreover, any “continu- 
ous” map Y — X between locales gives rise to a geometric morphism 
Sh(Y) — Sh(X) between such sheaf topoi. 

These locales are introduced in Chapter IX. There we show that 
every (Grothendieck) topos € has an underlying locale Loc(€). More- 
over, every topos is a “quotient” of the sheaf topos for some locale. 
More explicitly, from any topos € one can construct, by a method of 
Diaconescu, first a locale X, then the topos Sh(X) of sheaves on that 
locale, and finally a geometric morphism Sh(X) — €. This morphism 
is both a surjection (like a map onto a space) and open (in a suitable 
sense). Thus, € is indeed a quotient of its “Diaconescu cover” X. 

Those topoi which are “finitely generated” in an appropriate sense 
are said to be coherent. Deligne’s theorem in Chapter IX states that each 
coherent topos € has “enough” points. More explicitly, the underlying 
locale Loc(E) is an ordinary topological space, and the Diaconescu cover 
of € can be replaced by an ordinary topological space X which “covers” 
E by way of a surjection from the topos Sh(X) of sheaves onto the 
coherent topos €. 
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At the end of the book, the reader will find an Appendix which 
discusses how various different sites can represent the same topos, and 
an Epilogue which provides some suggestions for further reading on the 
subject, beyond the “First Introduction” which this book is meant to 
provide. 

A reference to III.6.(11) is to equation (11) in section 6 of Chapter 
III, and similarly for theorems. 


Categorical Preliminaries 


Before embarking on the actual topic of this book, we wish to review 
briefly the basic notions that will be used from category theory. Many 
readers will be familiar with these preliminaries; they should immedi- 
ately start with Chapter I, referring back to these preliminaries whenever 
necessary. On the other hand, these preliminaries do not present suf- 
ficiently many examples and are by no means enough to constitute a 
proper introduction to category theory, and the reader who lacks suffi- 
cient categorical background is advised to first read some of the relevant 
parts of Mac Lane’s [CWM—Categories for the Working Mathe- 
matician, 1971] (or some other such text), perhaps using the following 
pages as a guideline. 

A category C consists of a collection of objects (often denoted by 
capital letters, A,B,C,...,X,...), a collection of morphisms (or maps 
or arrows) (f,g,-...), and four operations; two of these operations as- 
sociate with each morphism f of C its domain dom(f) or do(f) and 
its codomain cod(f) or di(f), respectively, both of which are objects of 


C. One writes f: C — D or C — D to indicate that f is a morphism 
of C with domain C and codomain D, and one says that f is a mor- 
phism from C to D. The other two operations are an operation which 
associates with each object C' of C a morphism 1g (or idc) of C called 
the identity morphism of C' and an operation of composition which as- 
sociates to any pair (f,g) of morphisms of C such that do(f) = di(g) 
another morphism f 0g, their composite. These operations are required 
to satisfy the following axioms: 


(i) do(lc) = C =di(1c), 
(ii) do(f°g) =do(g), di(fog)=di(f), 
(iii) lpof=f, folco=f, 

(iv) (fog)oh=fo(goh). 
In (ii)-{iv), we assume that the compositions make sense; thus, (ii) is 
required to hold for any pair of arrows f and g with do(f) = di(g), 
and (iii) is required to hold for any two objects C and D of C and any 
morphism f from C to D, etc. 
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For example, there is a category Sets whose objects are sets and 
whose morphisms are functions with the usual composition. Similarly, 
topological spaces and continuous maps between them form a category, 
as do groups and homomorphisms, or vector spaces (over R say) and 
linear maps. Any partially ordered set (P,<) gives rise to a category, 
with the elements of P as objects, and with precisely one morphism from 
p to q iff p < q; in other words, the morphisms are pairs (p, g) such that 
p <q, and the domain and codomain operations on a pair are given by 
the first and second projections. Thus, the composition operation for P 
is uniquely determined by the transitivity of the order relation <. We 
mention in particular the categories 0,1,2,... coming from the ordered 
sets 0, {0}, {0,1},... of natural numbers with their usual ordering. 

An example of a different nature is obtained from a group G. Such a 
group can be regarded as a category with only one object, call it *, and 
with the elements of the group G as morphisms, where the multiplication 
of the group is used as the composition operation of the corresponding 
category. 

In an arbitrary category C, a morphism f: C — D in C is called an 
isomorphism if there exists a morphism g: D — C such that fog=1p 
and go f = 1c. (This defines g uniquely, and g is called the inverse of 
f.) If such a morphism f exists, one says that C' is isomorphic to D, 
and one writes f: C->D and C & D. The example of a category coming 
from a group G, as mentioned above, shows that a group is the same 
thing as a category with only one object in which each morphism is an 
isomorphism. 

A morphism f: C — D is called an epi(morphism) if for any object 
E and any two parallel morphisms g,h: D3 E in C, gf = hf implies 
g = h; one writes f: C—» D to indicate that f is an epimorphism. 
Dually, f: C — D is called a mono(morphism) if for any object B and 
any two parallel morphisms g,h: B=3C in C, fg = fh implies g = h; 
in this case, one writes f: C— D. Two monomorphisms f: A»—> D and 
g: BD with a common codomain D are called equivalent if there 
exists an isomorphism h: AB with gh = f. A subobject of D is an 
equivalence class of monomorphisms into D. The collection Subc(D) of 
subobjects of D carries a natural partial order defined by [f] < [g] iff 
there is an h: A — B such that f = gh, where [f] and |g] are the classes 
of f: A> D and g: BD. 

For Sets (and other familiar categories) this definition matches the 
usual notion of subset (or subspace, etc.). 

If C is a category, we sometimes write Co for its collection of objects 
and C, for its collection of morphisms. For two objects C and D, the 
collection of morphisms with domain C' and codomain D is denoted by 
one of the following three symbols, 


Homc(C, D), Hom(C, D), C(C, D). 
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In general, we shall not be very explicit about set-theoretical founda- 
tions, and we shall tacitly assume we are working in some fixed universe 
U of sets. Members of U are then called small sets, whereas a collection 
of members of U which does not itself belong to U will sometimes be 
referred to as a large set. Given such an ambient universe U, a cate- 
gory C is locally small if for any two objects C and D of C the hom-set 
Homc(C, D) is a small set, while C is called small if both Cp and C, are 
small sets. Of the categories mentioned above, the categories of small 
sets, of small topological spaces, of small vector spaces, and of small 
groups are all locally small but not small. The categories coming from a 
small poset (P, <) or a small group G in the universe U are both small. 

Given a category C, one can form a new category C°P, called the 
opposite or dual category of C, by taking the same objects but reversing 
the direction of all the morphisms and the order of all compositions. In 
other words, an arrow C' — D in C°P is the same thing as an arrow 
D—-C in C (see [CWM,, p. 33]). 

Given a category C and an object C' of C, one can construct the 
comma category or the slice category C/C (read: C over C): objects 
of C/C are morphisms of C with codomain C’, and morphisms in C/C 
from one such object f: D — C to another g: E — C are commutative 
triangles in C 


ie., gh = f. (In [CWM, the notation C | C is used instead of C/C; 
cf. |CWM, p. 46].) The composition in C/C is defined from the com- 
position in C, in the obvious way (paste triangles side by side). 
Categories are compared by using functors. Given two categories 
C and D, a functor from C to D is an operation F which assigns to 
each object C of C an object F(C) of D, and to each morphism f of 
C a morphism F(f) of D, in such a way that F respects the domain 
and codomain as well as the identities and the composition: F(do(f)) = 
do(F(f)), F(di(f)) = di(F(f)), Fc) = 1ric), and also F(f 0 9) = 
F(f) 0° F(g), whenever this makes sense. One writes F: C — D or 


C zs D. For example, there is a functor from the category of topological 
spaces and continuous maps to the category of sets and functions, given 
by sending a space to the “underlying” set of its points. If C is an 
arbitrary category and C is an object of C, then the domain operation 
gives a functor 

F:C/C-C 
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sending an object f: D = C of C/C to Ff = do(f) and defined in the 
obvious way on morphisms. 

For a category C, there is an identity functor idc: C — C, and for 
two functors F: C — D and G: D — E, one can form a new functor 
GoF: CE by composition. (Ignoring set-theoretic difficulties, there 
is thus a “category of categories” .) 

Let F and G be two functors from a category C to a category D. 
A natural transformation a from F to G, written a: F — G, is an 
operation associating with each object C of C a morphism ag: FC > 
GC of D, in such a way that, for any morphism f: C’ > C in C, the 
diagram 


FOO" 


ro] |e 


commutes, ie., G(f) 0 ac = ago F(f). The morphism ac is called 
the component of a at C. If every component of a is an isomorphism, 
a is said to be a natural isomorphism. If a: F — G and 8: G — H are 
two natural transformations between functors C > D, one can define a 
composite natural transformation (0 a by setting 


(Boa)c = Barc) 9 ac. 


For fixed categories C and D, this yields a new category D©: the objects 
of D© are functors from C to D, while the morphisms of D© are natural 
transformations between such functors. Categories so constructed are 
called functor categories. 

For categories C and D, a functor F: C°P — D is also called a 
contravariant functor from C to D. In contrast, ordinary functors from 
C to D are sometimes called covariant. Thus, C’ + Homc(C’,C) 
for fixed C' yields a contravariant functor from C to Sets, while C > 
Homc(C’,C) for fixed C’ is the covariant Hom-functor. 

A functor F: C > D is called full (respectively faithful) if for any 
two objects C' and C’ of C, the operation 


Homc(C’,C) — Homp(FC’, FC); froF(f); 


induced by F is surjective (respectively injective). A functor F: C >= D 
is called an equivalence of categories if F is full and faithful, and if, 
moreover, any object of D is isomorphic to an object in the image of F. 
For example, if F: C — D is a functor such that there exists a functor 
G: D > C and natural isomorphisms a: FoG> idp and 8: GoFSidc, 
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then F is an equivalence (and G is sometimes called a quasi-inverse 
for F). Conversely, using a sufficiently strong axiom of choice, every 
equivalence F has a quasi-inverse; see [CWM, pp. 91-92]. 

Next, we recall several “universal” constructions. For example, in 
the category Sets of (small) sets and functions between them, there is 
the construction of the cartesian product A x B of two sets A and B. It 
comes together with two projections 7}: Ax B — Aand m2: AxB- B. 
Usually, A x B is constructed as the set of all pairs (a,b) with a € A 
and b € B. However, up to isomorphism, the product A x B can also 
be described purely in terms of objects and morphisms in the category 
of sets, as follows. We say that an object X equipped with morphisms 
™: X — A and m2: X — B is a product of A and B if for any other 
object Y and any two maps f: Y — A and g: Y — B there is a unique 
map h: Y — X such that 7,0h = f and m20h=g. [This unique map 
is then denoted by (f,g): Y — X or sometimes by ( f,g) with pointed 
brackets.] This definition makes sense in any category and determines 
the object X (if it exists) to within isomorphism. It is common to denote 
a product of two objects A and B in an arbitrary category, if it exists, 
by A x B. Iteration then yields products of three or more factors. In 
an arbitrary category, the product of two objects may or may not exist; 
for instance, in the category of topological spaces, the product of two 
spaces always exists, and it may be constructed as the cartesian product 
of the underlying sets, equipped with the familiar product topology. A 
product of an J-indexed family A; is written II;A,. For a poset (P, <), 
viewed as a category in the way explained above, the product of two 
objects p and q is their infimum (greatest lower bound), which may or 
may not exist. 

Other special constructions of sets may also be characterized purely 
in categorical terms, i.e., in terms of objects and morphisms. For ex- 
ample, the singleton set {*} is the set S, unique up to isomorphism, for 
which there is exactly one morphism A — S from any other set A into 
S. In an arbitrary category C, an object C with the property that for 
any other object D of C there is one and only one morphism from D to 
C is called a terminal object of C. If it exists, it is, like {x}, unique up 
to isomorphism; it is often denoted by 1 or by lc if necessary. 

A construction which plays a central role in this book is that of a 
pullback, or fibered product. Given two functions f: B > A and g: C > 
A between sets, one may construct their fibered product as the set 


BxaC={(b,c)EBxC| f(b) = g(c) }. 


Thus, B x 4C is a subset of the product, and, therefore, comes equipped 
with two projections m,: Bx4C — B and m2: Bx4C > C which fit 
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into a commutative diagram 


Bx,zC—=2 +C 


[he 


B——+ A; 


ie., fa, = gm. This diagram has the property that given any other set 

X and functions @: X — Bandy: X — C'such that ff = gy, there isa 

unique function 6: X — Bx, C with 76 = @ and 726 = ¥ [namely, the 

function 6(x) = (G(x), 7(x))|. This property determines the set B x 4 C 

up to isomorphism. One says that (1) is the “universal” commutative 

square on the data f and g. If A = {*}, B x4 C is the product B x C. 
In a general category C, one says that a commutative square 


i ° 


is a pullback (square), or a fibered product, if it has the property just 
described for sets: given any object X of C and morphisms @: X — B 
and 7: X — C' with f@ = gy, there is a unique 6: X — P such that 
pé = B and q6 =. [This unique map 6 


is usually denoted by (G,7).] Given f: B > A and g: C — A, the 
pullback P with its projections p and q is uniquely determined up to 
isomorphism (if it exists at all), and one usually writes B x 4 C for this 
pullback. If, for given f and g, the pullback (2) exists, one also says 
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that the arrow p is the pullback of g along f (and symmetrically that 
q is the pullback of f along g). Notice that p is a monomorphism if g 
is. One says that monomorphisms are preserved by pullback along an 
arbitrary morphism. Incidentally, the notion of monomorphism can be 
described in terms of pullbacks. A morphism f: B — A in a category C 
is a monomorphism iff the pullback of f along itself is an isomorphism, 
iff the square 


is a pullback. 
There is an important “pasting-lemma” for pullback squares 
({CWM,, p. 72]). Given a commutative diagram of the form 


Q +P + D 
| | | 
C +B >A 


in an arbitrary category C, the outer rectangle is a pullback if both inner 
squares are pullbacks; and conversely, if the outer rectangle as well as 
the right-hand square are pullbacks, then so is the left-hand square. 

Equalizers also deserve mention. For two parallel arrows f: A — B 
and g: A —> B in a category C, the equalizer of f and g is a morphism 
e: E > A such that fe = ge and which is universal with this property; 
that is, given any other morphism u: X — A in C such that fu = gu, 
there is a unique v: X — FE such that ev = u: 


Equalizers need not always exist. However, in the category of sets 
the equalizer of any pair of functions f,g: A= B exists, and can be 
constructed as the set 


E={aeéA| f(a) =9(a)} 


where e: E — A is set inclusion. 
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To each of the categorical notions of product, terminal object, pull- 
back, and equalizer, there is a corresponding dual notion, namely, that 
of a coproduct, an initial object, a pushout, and a coequalizer. For exam- 


ple, a diagram A ae XB in a category C is said to be a coproduct 
of A and B if the corresponding diagram A «< X — Bin C? isa 


product. In other words, A »Xe_ Bisa coproduct iff for any two 
morphisms f: A — Y and g: B = Y into another object Y, there is a 
unique morphism h: X — Y with hu = f and hv = g. The coproduct 
of A and B, if it exists, is unique up to isomorphism and is denoted by 


A+B or AIlB. The maps A — ALI B and B — AIIB are called the 
coproduct inclusions. For example, the disjoint sum (= disjoint union) 
defines a coproduct in the category of sets. A coproduct of a family A; 
for i € I is written as II; A;. 

Similarly, an object C of C is an initial object of C if it is a termi- 
nal object in C°P. An initial object of C, if it exists, is unique up to 
isomorphism and is usually denoted by 0. 


f 
Given morphisms A — B and A “5 Cin C, a diagram 


A—t-B 


| I 
Ca oP 
is called a pushout if the corresponding diagram in C°? is a pullback. 
The pushout of f and g, if it exists, is unique up to isomorphism and is 
denoted by P= BU, C. 
Finally, a morphism B -—+ C in C is said to be a coequalizer of 
a given parallel pair of morphisms f,g: A= B iff the corresponding 
diagram C' > B= A in C°P is an equalizer. 
We now come to the central notion of adjoint functor, discussed at 
length in Chapter IV of [CWM]. Consider two categories A and X and 
two functors between them in opposite directions, say 


F:X—OA G:A—X. (5) 


One says that G is right adjoint to F (and that F is left adjoint to G, 
notation: F 4 G) when for any two objects X from X and A from A 
there is a natural bijection between morphisms 
f 

X—-GA 

h ? 

FX—+A 
in the sense that each morphism f, as displayed, uniquely determines 
a morphism hk, and conversely. This bijection is to be natural in the 


(6) 
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following sense: given any morphisms a: A — A’ in A and £: X’ > 
X in X, and corresponding arrows f and h as in (6) the (inevitable) 


composites also correspond under the bijection (6): 


f Ga 
xox —+ GA — GA! 


FE h a (7) 
PX'—> FX -+A—->A! 
If we write this bijective correspondence as 
6: Homx(X, GA) —~—> Homa (FX, A), (8) 


then this naturality condition can be expressed by the equation 


9(G(a) o fo) = a0 Wf) 0 F(£). (9) 


Examples of adjoints abound in mathematics. The reader may find a 
list. of examples on pp. 85-86 of [CW™M, and will encounter numerous 
other examples in the course of this book. 

With an adjunction as above, there are associated certain so-called 
unit and counit morphisms. Given @ as in (8), and an object X in X, 
setting A = FX gives a unique map 


n=nx: X ~ GFX (10) 


such that 6(nx) = 1px). This map nx is called the unit of the ad- 
junction (at X). If one takes € = ly, A= FX, f=, a = h, and 
A’ = A in (7), the bottom composite is simply h: FX — A, and it 


corresponds to the top composite X pail GFX a GA. In short, 7 
determines the adjunction, since h corresponds to G(h) o 7x under the 
correspondence (6). This means that each f determines uniquely an h 
which makes the following triangle commute: 


XS GPX FX 
P ie th (11) 
GA A, 


One expresses this by saying that 7 = nx is universal among arrows from 
X to an object of the form GA. This also implies that when the functor 
G is given, the object FX is uniquely determined up to isomorphism. In 
other words, given a functor G, its left adjoint F' (if it exists) is unique 
up to natural isomorphism. Also, given G and a universal arrow from 
each object X to some object of the form GA, the left adjoint must 
exist. The naturality condition (7) [or (9)] also implies that the unit 


Categorical Preliminaries 19 


morphisms nx: X — GFX, for varying objects X of X, constitute a 
natural transformation from the identity functor on X to the composite 
functor GF: X — X for @(nx o£) = F(€) = 0(GFE o nx’). 

Dual to the unit of an adjunction is the counit. In the correspondence 
(6), take X = GA and f the identity on GA. The corresponding h is 
written 

e or €4: FGA- A. 


This defines a natural transformation from F'G to the identity functor 
on A. Its universal property is this: to every h: FX — A there exists a 
unique f which makes the following triangle commute: 


x FX 
f\ Ff} io (12) 
GA FGA—> A. 


In other words, € is universal among arrows from an object in the image 
of F to A. As for the unit, this implies that given F, its right adjoint G 
(if it exists) is determined uniquely up to isomorphism. 

In the diagram (12), one may take h to be the identity on A= FX. 
The corresponding map f is then the unit 7x of the adjunction, and we 
obtain a commutative triangle 


G—2.GFG 


ONE 


HGP oo 


as on the left of (13). Its dual is the right-hand triangle. Conversely, 
two natural transformations «: FG — id and 7: id — GF which satisfy 
these two triangular identities (13) serve to make F a left adjoint of G 
[CWM, pp. 80-81]. 

As an example, consider the product category C x C of a given 
category C with itself. (C x C may also be viewed as a functor category 
C?, where 2 is the category with two distinct objects 0 and 1, and 
identity morphisms only.) If the product A x B of any pair of objects 
A and B in C exists, this gives a functor x: C x C > C which is right 
adjoint to the diagonal functor C — C x C sending C to (A, A). This 
follows immediately from the definition of the product. 

Suppose products exist in C. For a fixed object A of C, one may 
consider the functor 

Ax —:C-C. (14) 
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If this functor has a right adjoint (necessarily unique up to isomorphism), 
this adjoint is denoted by 


(=) Cx. (15) 


In this case A is said to be an exponentiable object of the category C, 
while the value B4 of (15) for an object B of C is called the exponential 
of A and B. That (—)4 is right adjoint to A x — means that for any 
objects B and C of C there is a bijective correspondence 


C— BA 


————— 16 
AxC— B’ uo) 


natural in B and C. It follows from the various uniqueness properties 
that B4 is also a (contravariant!) functor of A (at least on those A 
which are exponentiable), and that (16) is also natural in A. The counit 
of this adjunction A x (—) 4(—)4 is a map 


e:Ax BA+B (17) 


with the property that for any map f: C > B4 there is a unique h: Ax 
C — B such that €0 (1x f) =h: 


Ax BA—£—_-B 


. 
lxf Ue 


h 
AxC. 


In this special case, the counit map is called the evaluation and denoted 
by e orev: Ax B4 > B. 

A category C is called cartesian closed if it has finite products (i.e., 
a terminal object and binary products) and if all objects of C are ex- 
ponentiable. For example, the category of sets is cartesian closed: the 
exponential B4 of two sets A and B is simply the set of all functions 
from A to B, and the bijective correspondence (16) is the familiar pro- 
cess of turning a function f: Ax C' — B of two variables into a function 
of a single variable in C with values in BA. 

We now turn to a brief discussion of limits and colimits. Let C 
be a fixed category. For a small category J (the “indexing category” ) 
we consider the functor category CJ. An object of C4 is also called a 
diagram in C of type J. For example, each object C of C determines a 
constant diagram 43(C’) which has the same value C for all 7 € J; this 
defines the diagonal functor 


Acs Cc}. (18) 
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A natural transformation x from the constant diagram Aj(C') to some 
other diagram A of CJ then consists of maps f,: C — Aj, one for each 
“index” 7 in J, all such that the triangle 


C 
vy Ne ui jk, (19) 


A(j) amy AQ), 


commutes for every arrow u of J. Such a natural transformation is called 
a cone f: C — Aon the diagram A with vertex C’. In particular, a cone 
a: L — A with vertex L is universal to A when to every cone f: C3 A 
there is a unique map g: C' — L in C with 1; 0g = f;, for each j of J, 
as in the commutative diagram 


C ---------- foe eats >L 
ee Ba 
fr A(j) Te u:jrk. 
co) 
A(k). 


This universal cone 7: L — A (or, less accurately, its vertex L = lim | A) 


is called the limit of the diagram A. If every diagram A in C4 has a 
limit in this sense, then the diagonal functor A; has a right adjoint 


lim : CJC. (20) 
<—J 


Indeed the counit of this adjunction is precisely the universal cone, which 
can be viewed as a natural transformation 


TW: A;(L) = A;(lim, A) > A, 


For example, if J = 2 = {0,1} is the discrete category with two 
objects 0 and 1 and only identity arrows, then a diagram in C? is just 
a pair of objects of C and a limit of that diagram is just a product 
of these objects. The product is thus a special case of limit. In the 
same way, a terminal object, a pullback, or an equalizer, as discussed 
above, are all special cases of limits (when J is the empty category, or 
the category —+ e —, or ee, respectively, where we have indicated 
only the nonidentity morphisms). 
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The dual of the notion of limit is that of colimit. A cocone with 
vertex C' on a diagram A: J — C is a map A — Aj(C) in the functor 
category C4. The universal cocone on A, if it exists, is called the colimit 
of the diagram A, and its vertex is denoted by lim | A. If the colimit of 


any diagram of type J in C exists, this gives a functor 
lim_:C7—=C (21) 
I 


which is left adjoint to the diagonal Ay: C > C4. 
Now suppose G: C — D is a functor. If J is a small index category, 
G induces a functor GI: C3 — D! of diagrams in the obvious way. If 
limits of type J exist in C and D, one obtains a square of categories and 
functors 
lim 
Cc! 3.4.0 


rend | ie (22) 


The universal property of limits implies that there is a canonical natural 
transformation 
ay: Golim_ = lim_oG. (23) 
—-J J 


One says that G preserves limits (of type J) if ay is a natural isomor- 
phism. A basic property is that G preserves limits of any type if G 
has a left adjoint. Or briefly, right adjoints preserve limits (see [CWM, 
p. 114]). Dually, one defines preservation of colimits. The corresponding 
basic fact is that left adjoints preserve colimits. 

We conclude these preliminaries by mentioning an important fact 
about limits and colimits in functor categories, namely, that these are 
computed pointwise. More precisely, let C and D be categories and con- 
sider the functor category CP. If J is a small category such that limits 
of type J exist in C, then the same is true in CP, and the evaluation 
functor (—)p : CP — C, for any given object D of D, preserves such 
limits. In other words, for a diagram A: J — CP of type J in C?, one 
obtains a diagram Ap: J — C for each object D of D, by setting 


Ap(j) = AGj)(P). 


If every such diagram Ap has a limit Lp = lim Ap in C then these 


limits fit together to give a functor L: D — C, which is a limit for the 
diagram A. So for each D in D 


(lim, A)(D) = lim, Ap, (24) 
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where the limit on the left is taken in CP and that on the right in C. 
The corresponding fact for colimits also holds, and gives an isomor- 
phism 
(lim, A)(D) © lim, Ap (25) 


analogous to (24). For details, we refer the reader to [CWM, p. 112]. 


I 
Categories of Functors 


Many constructions on various mathematical objects depend not just 
on the elements of those objects but also on the morphisms between 
them. Such constructions can thus be effectively formulated in the cor- 
responding category of objects. A “topos” is a category in which a 
number of the most basic such constructions (product, pullback, expo- 
nential, characteristic function, ...) are always possible. With these 
constructions available, many other properties can be efficiently devel- 
oped. Superficially quite different categories, arising in geometry, topol- 
ogy, algebraic geometry, group representations, and set theory, all turn 
out to satisfy the axioms defining such a topos. 


1. The Categories at Issue 


Our exposition starts by describing a number of specific categories 
which are topoi, exhibiting in each one several of the basic constructions 
required. These examples will pave the way to the formulation of the 
axioms for a topos. 

In the following list of many such categories the most important 
examples are those numbered (i), (viii), (x), and (xi): sets, functor 
categories (presheaves), sheaves, and group actions. 

Here is the list of examples of topoi: 

(i) Sets, the category of all (small) sets S, T, and functions S > T 
between them. 

(ii) Sets x Sets, the category of all pairs of sets, with morphisms 
pairs of functions. 

(iii) Sets”, the category of all n-tuples of sets with morphisms all 
n-tuples of functions. Here n is a fixed natural number. 

(iv) BG, or G-Sets, the category of all representations of a fixed 
group G; where a representation of G consists of a set X together with 
a right action wu: X x G— X of Gon X. This action is usually denoted 
simply by a dot, as in u(z,g) = x-g; one requires yz to satisfy the 
identities c-1 = x and (z-g)-h = x- (gh), for all x € X and g, 
h €G. A morphism between two such representations (X, 4) and (Y,v) 


1. The Categories at Issue 25 


is a function f: X — Y which respects the action [in the sense that 
f(x-g) =f (x)-g for allxe € X andg € Gl. 

(v) BM, or M-Sets, the category of all representations X x M — X 
of a fixed monoid M on a variable set X; as in (iv), a morphism of BM 
is a function which respects the action. 

(vi) Sets?, the category whose objects are all functions ¢: X — X’ 
from one set X to a second set X’, with the evident arrows (commutative 
squares) between these objects. 

(vii) Sets , the category whose objects are all sequences X, 


Xp 7 X1 > XQ: 


of sets X,, and functions X, — Xn+41, with the evident arrows X > Y. 
It has been suggested (Lawvere) that such a sequence X be considered 
as a “set through time”, where each X,, is regarded as the state of the 
variable set X at the (discrete) time n. The exponent N here is the 
linearly ordered set of natural numbers, to be regarded as a category, so 
that a sequence X,, is a functor N — Sets. 

(viii) Sets©” where C is a fixed small category and C°P its op- 
posite. This is the usual functor category, with objects all functors 
P : C% —, Sets and arrows P — FP’ all natural transformations 
0: P — P" between such functors. Recall that such a @ assigns to 
each object C' of C a function 0¢: P(C) — P’(C), in such a way that 
all diagrams 

Pc —f_. Pp 


P'C ry a P'D, 
for f: D > C an arrow in C, are commutative. Each object P in this 
category is a contravariant set-valued functor on C; in anticipation of 


Example (x) below, such a P is also called a presheaf on C. In the 
notation of the French school, 


Sets©” = C (1) 
is the category of all presheaves on C. If P is a presheaf on C and 


z € P(C), the value P(f)(zx) for an arrow f: D — C in C is called the 
restriction of x along f, and is often denoted by | or by a dot: 


P(f)(x) =alf =a: f. (2) 


Here f is written after z, because the contravariant character of P is 
then expressed for a composite fog as x-(fog)=(a- f)-g. 
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Each object C' of C gives rise to a presheaf y(C’) on C, defined on 
an object D of C by 


y(C)(D) = Home (D, C) (3) 


and on a morphism D’ pa D, for u: D = C, by 


y(C)(a): Homc(D,C) — Home (D"’, C) 
y(C)(a)(u) = uoa; 


or briefly, y(C) = Homc(—,C) is the contravariant Hom-functor. 
Presheaves which, up to isomorphism, are of this form are called repre- 
sentable presheaves or representable functors. If f: Ci — C2 is a mor- 
phism in C, there is a natural transformation y(C,) — y(C2) obtained 
by composition with f. This makes y into a functor 


(4) 


y: C— Sets©”, CH Home(—,C) (5) 


from C to the contravariant functors on C (hence the exponent C°P). 
It is called the Yoneda embedding. The Yoneda embedding is a full 
and faithful functor. This fact is a special case of the so-called Yoneda 
lemma, which asserts for an arbitrary presheaf P on C’ that there is 
a bijective correspondence between natural transformations y(C) > P 
and elements of the set. P(C): 


9: Homg(y(C), P) —— P(C), (6) 


defined for a: y(C) — P by 0(a) = ac(1c) (see [CWM, p. 61)). 

(ix) Sets/J, the comma category or slice category, with objects all 
sets over the fixed set J. Here, a set over J is by definition a function 
h: X — J from a (variable) set X to J, and with arrows commuting 
triangles as in (8) below. We also think of X (via h) as a set over the 
“base” J. 

(x) Sh(X), the category of all sheaves of sets over a fixed topological 
space X. This important example will be explained in Chapter II below. 

(xi) Let G be a topological group. The category BG of continuous 
G-sets has as objects sets X equipped with a right action p: XxG > X, 
as in (iv), with the additional requirement that this action be continuous 
when X is equipped with the discrete topology. The morphisms are the 
same as those described in (iv). 

(xii) Simplicial sets: a simplicial object S in a category C is a family 
Sp for n > 0 of objects of C, together with for each n two families of 
morphisms of C 


di: Sn — Sp-1, $,: Sn > Sn4i, i=0,...,n 
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(and with n > 0 in the case of d;) which satisfy the identities 


d;d, = dj_1d;, i<j, 


S485 = $5 415i, ts, 
dis, = 83-1dj, i<j, (7) 
= j= pendia jd 


87d;-1, t>j+1. 


In particular, a simplicial object in Sets is called a simplicial set. If 
A, is a “standard” affine n-simplex, a continuous map f: A, - X 
into a topological space X is called a singular simpler for X. Such a 
simplex f has n+ 1 faces djf: A,_; — X determined by restricting 
f to the i* face of A,; also collapsing vertex i to vertex i + 1 gives 
n+1 maps A,4,) — A,: composed with f they yield n+1 “degenerate” 
singular simplices s;f. Taking d; to be the i** face and s; to be the 
ith degeneracy makes the collection of all such singular simplices of X 
into a simplicial set—-one from which the homology, cohomology, and 
homotopy of X can be computed. For many purposes, the category of 
topological spaces may be replaced by the category of simplicial sets. 

(xiii) FinSets, the category of all finite sets and functions between 
them. 

(xiv) FinSets©”” , the category of all functors from C (a fixed finite 
category) to FinSets. 

In this list the decisive types of examples are (i), (viii), and (x): 
Sets, set-valued functors (presheaves), and sheaves. These correspond 
to the major thrusts of our subject, toward the foundation of sets, the 
manipulation of functor categories, and the properties of sheaf coho- 
mology. As a matter of fact, each of the categories (i)—(vii) above is a 
special case of a functor category Sets©”’ —and in each case, the arrows 
of the category in question are precisely the natural transformations of 
functors—for the following choices of the category: 

(i) C is the category 1 with one object and one (identity) arrow. 

(ii) C = 1+1 is the discrete category with two objects (and therefore 
with exactly two arrows, the identity arrows of these objects). 

(iii) C is the discrete category with n objects. 

{iv) C is the group G, regarded as a category with one object, with 
arrows the elements of G, and with composition the product in the group 
G. 

(v) C is the monoid M, regarded in the same way as a one-object 
category. 

(vi) C = 2 is the “arrow category”: The category with exactly two 
objects 0 and 1 and one nonidentity arrow 0 > 1. 
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(vii) C = N°? is the category whose objects are the natural numbers 
n =0,1,2,..., and whose arrows n — m are exactly the pairs (n,m) 
with n > m. 

Extending (vi) and (vii), recall that any ordered or preordered set P 
will yield a category P with objects the elements p € P and arrows the 
pairs (p,q) with p < q in the given preorder. For example, the ordered 
set R of real numbers yields in this way the functor category Sets® 
whose objects are “sets seen through (real) time” (cf. (vii) above]. 

The comma category described in (ix) is almost a functor category. 
An object of this comma category is an arrow h: X — J of Sets, while 
an arrow f: h — h' between two such objects is an arrow f: X > X' 
of Sets such that the triangular diagram 


Ra 
A a ) 


commutes. Each such object h: X — J over J determines a J-indexed 
family { H; | 7 € J} of sets, consisting of the sets 


H, = h-{j} = {a | € X and he = j}, 


and then each arrow f: h — h’ as in (8) determines a J-indexed family 
of functions f;: Hj — Hj, j € J. If we regard the set J here as a 
discrete category (with objects all elements 7 € J and arrows only the 
identity arrows j7 — j), then each J-indexed family of sets is just a 
functor H: J — Sets and each J-indexed family of functions f, is just 
a natural transformation F': H — H' between these functors. In other 
words, the assignments h +> {Hj}, f > {f,} constitute a functor 


L: Sets/J > Sets’, hh {Hj} 


from the comma category to the functor category. 

Reciprocally, each functor H: J — Sets determines a set h: X > J 
over J, with X the disjoint union (the coproduct) X = |] H; of the sets 
H;; for j € J and h the function which sends each x € X into its “index” 
(that 7 € J with xz € H;). These two reciprocal constructions amount 
to constructing two functors D and M 


Sets/J ae Sets’ (9) 


with both LM and ML naturally isomorphic to the respective identity 
functors. Therefore, these two constructions provide an equivalence of 
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the comma category Sets/J to the functor category Sets’. This equiva- 
lence is not an isomorphism of categories because the composite functor 
ML is not the identity, on account of the choice available in the forma- 
tion of the disjoint union involved in the construction of M. (That is, 
many different sets over J correspond under L to the same functor on 
J.) 

The category of simplicial sets is also a functor category: 


SimpSets = Sets”, (10) 


where A is the category whose objects are all the finite ordered sets 
[n] = {0,1,...,2} and whose morphisms [n] — [m] are those maps 
¢: [n] — [m] which preserve the order (i.e., i < j implies di < $j). The 
isomorphism (10) 

comes about because the object [n] in A can be regarded as the 
ordered set of vertices 0,1,...,n of the standard n-simplex A, with i‘ 
face spanned by 0,1,... +i, ...,”, omitting i. Details may be found in 
many sources, for example in [Mac Lane, Homology 1963, p. 233; see 
also VIII §7]. 


2. Pullbacks 
We will make extensive use of pullbacks. Recall that a pullback for 


a diagram X us Be Y in a category C is a commutative square, with 
vertex P, on the edges f and g, as below, which is universal among such 
squares: To any other such commutative square, with vertex Q, on these 
edges, as in (1), 


P oat ore Y- Q ales Y 
a [ go! i (1) 
xX =e B, > re 2) 


there is a unique arrow h: Q — P with fo = f’h and gp = g/h. As usual, 
this universality characterizes the pullback square (when it exists) up to 
isomorphism; its vertex P is called the fibered product, P = X xp Y of 
X and Y (relative to f and g). 

In Sets, the pullback P in (1) always exists and is (isomorphic to) 
the set of all those ordered pairs (2, y) of elements x € X, y € Y with 
fx = gy in B. In particular, if Y is a subset of B and g: Y > B the 
inclusion, the pullback P is (isomorphic to) the inverse image f~! of Y 
in X. If both f and g are inclusions of subsets of B, their pullback P “is” 
the intersection of these subsets. If the set g: Y — B over B is regarded 
as a B-indexed set { G;, }, its pullback P along f is the X-indexed set 
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{G,} with GL = Gyz; in pictures: over each point c € X put a copy 
G', of the set given over fx. Similarly, if g is a fiber bundle of some sort, 
g’ is (in a suitable category) the familiar induced fiber bundle. 


The pullback exists in any functor category Sets®”” and is con- 
structed “pointwise” (as pointed out in the preliminaries): If X, Y, and 
B in (1) are functors to Sets, with f and g natural transformations, 
then P: C°P — Sets is that functor which sends each C' € C to the set 
PC which is the pullback in Sets of XC — BC and YC — BC. In 
other words, (X xg Y)(C) = X(C) x ae) Y(C). 


Consider the pullback P of f with itself [f = g in (1)]. In Sets, P is 
the set of all pairs (2, y) of elements in X with fz = fy; in other words, 
PcX-xX is the equivalence relation which f induces on its domain X. 
In any category, the pullback P of f with f, when it exists, is a parallel 
pair of arrows P= X called the kernel pair of f. In particular, an arrow 
f is monic (= left cancelable) precisely when, up to isomorphism, both 
arrows in its kernel pair are the identity X — X. In particular, any 
functor preserving pullbacks preserves monics. 


In any category, a pullback g’ of a monic g along any arrow is itself 
monic; this may be proved by a simple formal argument. In Sets, it 
is also true that the pullback g’ of an epi is always epi (epi = right 
cancelable arrow); this is evident from the description of the pullback in 
Sets by elements, but there is no simple formal (categorical) argument. 
This property does also hold in all Examples (i)-(xiv), but the common 
reason, as we shall see in Chapter IV, is deeper. 


The one-point set {x} may be characterized (up to isomorphism 
again!) as a terminal object in Sets: To every set X, there is a unique 
function X — {x}. In the same way, each of our categories (ii)—(xiv) ) has 
a terminal object, call it 1; for example, the terminal object in Sets© 
is the functor whose value at every object C is {+}. 


Pullbacks and terminal objects are limits; specifically, a pullback in 
a category C is a limit of a functor (e — e — e) — C, while a terminal 
object in C is a limit of a (the) functor from the empty category 0 into 
C. Recall that a finite limit in C means a limit of a functor J — C 
where J is a finite category. A category C with a terminal object 1 and 
with all pullbacks has all finite limits: It has binary products, since the 
product X x Y can be constructed as the pullback X — 1 < Y, it 
has products of no factors (the terminal 1), and hence it has all finite 
products. The equalizer e of a pair f,g: X 3 Y can also be constructed 
as a pullback, namely, that of the map (f,g): X — Y x Y and the 
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diagonal A: 


Xa Y xy. 


This shows that a category with pullbacks and a terminal object has all 
finite limits because these can be constructed from finite products and 
equalizers [CWM, p. 109}. 

All the categories in our list (i)-(xiv) have finite limits: We have seen 
how to construct the terminal object and pullbacks in Sets, so Sets has 
finite limits, Consequently, so does a functor category Sets© ety since 
limits in Sets” can be computed pointwise, as just pointed out (cf. also 
(24) of the preliminaries]. In particular, this takes care of Examples (i)— 
(ix), and (xii), (xiii), and (xiv). Finite limits in the category of sheaves 
on a topological space X [Example (x) of §1] will be treated in Chap- 
ter II. This leaves the Example (xi) of the category BG of continuous 
G-sets, which is not a functor category. But ifa: X > Bandb: Y —B 
are maps of continuous G-sets, then their pullback X xg Y in Sets has 
an obvious coordinatewise action by G: 


(z,y)-9 =(@-9,y-9) 


for c € X and y € Y with a(x) = b(y), and g € G. This action is 
continuous if the actions on X and Y are each continuous. So X xg Y 
is again a continuous G-set, and this is easily seen to define the pullback 
in the category BG. Since BG has a terminal object (the one-point set 
with its unique action by G), it follows that BG has all finite limits and 
that these limits can all be constructed as limits of the underlying sets. 
In other words, the “forgetful” functor U: BG — Sets which forgets 
the action, U(X, 4) = X, preserves all finite limits. 


3. Characteristic Functions of Subobjects 


In Sets, a subset S C X may be described in two very different ways: 
As the monic function S — X given by inclusion or as a characteristic 
function ¢g defined as usual for elements x € X by 


0, rEeS, 


ds(e) ={ ie ees 


Here we take the values of ¢g in the typical 2-point set {0,1}; it is the 
set of “truth values”, where we have chosen 0 as the value “true”. It is 
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convenient to regard true as the following subobject (the monic arrow) 
from 1 to 2: 


true: 1 = {0} 2 = {0,1}, 0-0. (1) 


With this notation, each subset S can evidently be recovered (up to 
equivalence) from its characteristic function ¢s as the pullback of true 
along ¢s: 


S ——— 1 


"| |e (2) 


X mapo-9 2. 


In this diagram, S — 1 is the unique function from S to the terminal ob- 
ject (one-point set) 1, and 1 — 2 is the fixed monic defined in (1); given 
the monic m, there is a unique ¢ (namely, the characteristic function) 
such that the diagram (2) is a pullback. 

In Section 4, we will see that subobjects in our other typical cate- 
gories have similar characteristic functions, which take values not in 2, 
but in a suitable object O of “truth-values” . 


Definition. In a category C with finite limits, a subobject classifier 
is a monic, true: 1 — Q, such that to every monic S — X in C there is 
a unique arrow ¢ which, with the given monic, forms a pullback square 


S ———> 1 


| [oe (3) 


X ---3--9 0. 


In other words, every subobject is uniquely a pullback of a “univer- 
sal” monic true. 

This property amounts to saying that the subobject functor is rep- 
resentable (i.e., isomorphic to a Hom-functor). In detail, a subobject 
of an object X in any category C is an equivalence class of monics 
m: SX to X (cf. the preliminaries). By a familiar abuse of language, 
we say that the subobject is S or is m, meaning always the equivalence 
class of m. Then, Subc X is the set of all subobjects of X in the cat- 
egory C; this set is partially ordered under inclusion. The category C 
is said to be well-powered when Subc X is isomorphic to a small set 
for all X; all of our typical categories are well-powered. Now given an 
arrow f: Y — X in C, the pullback of any monic m: SX along f 
is a monic m’ : S’>>Y, and the assignment m +> m’ defines a func- 
tion Subc f: Subc X — Subc Y; when C is well-powered, this makes 
Subc: CP — Sets a functor to Sets. Briefly, Sub is a functor “by 
pullback”. 
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Proposition 1. A category C with finite limits and small Hom- 
sets has a subobject classifier if and only if there is an object 2 and an 
isomorphism 

6x: Subc(X) = Home (X,Q), (4) 


natural for X € C. When this holds, C is well-powered. 


Proof: Given a subobject classifier as in (3), the correspondence 
@x sending the equivalence class of each monic S — X to its (unique) 
“characteristic function” ¢: X — Q is a bijection for each X, as required 
for (4). Now Subc(X) is a (contravariant) functor of X by pullback (= 
inverse image); so to prove this bijection natural, we must show that 
pullback along f: Y — X in Subc( —) corresponds to composition with 
f in Home(—,2). This is immediate by the elementary fact that two 
pullback squares placed side by side, as in 


Ss’ ———°. § ———> 1 


a: 


Y >X >Q, 


yield a pullback (rectangle). Since the Hom-sets are all small, the bijec- 
tion (4) also proves C well-powered. 

Conversely, suppose that (4) is a bijection, natural in X. This states 
that Subc : C°P — Sets is naturally isomorphic to Homc(— , 9); that 
is, that the functor Subc is representable [cf. §1(5)|, with representing- 
object ©. As for any such representation, some subobject to: Qo 2. of 
2 corresponds to the identity 1: QQ, while each subobject S— X of 
X corresponds to an arrow ¢: X > 2. By naturality of 0, the diagram 


Sub(Q) Hom(2, 2) Q+-—— 1 
si psec | | 
Sub(X) —— Hom(X,Q), Si— ¢ 


must commute; this states that S = Sub(¢)M, and hence that each 
subobject S is the pullback of Qo along a unique “characteristic function” 
@, as in 

¢’ 


S ——> 


i: ° 
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This diagram is like the definition (3) of a subobject classifier, but it 
remains to show that Qo is actually a terminal object in C. But taking 
S — X in (5) to be the identity X — X gives a map ¢/: X — . If 
there were two maps ¢’, 6”: X — No then, since fp is monic, both the 
squares 


would trivially be Fallbacks: Therefore, by the uniqueness of ¢ in (5), 
tod’ = tod”. As to is monic, this gives ¢’ = ¢”. Hence each object X 
has a unique map ¢’: X — No, so No is terminal. 


As with any representation of a functor, this result proves that the 
subobject classifier of a category, if it exists, is unique up to an isomor- 
phism. 

The idea of a “subobject classifier” is modeled on other “classifying” 
ideas in topology. A decisive example is that of the classifying bundle for 
a Lie group G. As we will subsequently indicate, a G-bundle over X is 
a suitable continuous map 7: E — X of spaces for which G acts on the 
right on E in such a way that we = ze’ for e and e’ in E iff e = e’g fora 
unique element g in the group G. If G-Bund(X) is the (suitably defined) 
set of all G-bundles over some space X, then pullback of a G-bundle E —- 
X along a continuous map f: Y — X yields a G-bundle over Y, and 
this makes G-Bund a contravariant functor of X (again “by pullback”). 
Then a bundle V — B is said to be a universal G-bundle (and B is the 
“classifying space” for G) if every G-bundle E — X can be obtained 
from V — B by pullback along some X — B. For G = Oy the real 
orthogonal group in k variables, there is a famous such classifying bundle 
V. For large n, it is the Stiefel manifold V,,4441,4 whose points are all 
orthonormal k-frames of vectors v,,...,v% in R°+*+!_ The orthogonal 
group O, acts continuously on these frames v1, ..., vg in the evident way, 
so that two frames are equivalent under the action of O, if and only if 
they span the same k-plane. Therefore the projection p: V > V/ O, of 
the Stiefel manifold on its quotient by this action is an O,-bundle, and 
its base space V/ Oy, is precisely the Grassmann manifold Mnik41,% of 
all k-planes in R™+*+!. A standard argument [Steenrod, 1951] shows 
that p is a classifying bundle for principal O,-bundles over n-complexes 
K, in the sense that any such bundle can be obtained from p by pullback 
along a continuous map f: kK — V/ O, which is unique up to homotopy. 

Classifying bundles have played a major role in topology; we shall 
see that classifying subobjects play a similarly decisive role in category 
theory. Later on in this book, we shall see that the analogous idea of a 
classifying topos is central in our subject. 
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4. Typical Subobject Classifiers 


Each of our typical categories (i)—(xiv) has a subobject classifier. We 
will now explicitly construct these classifiers in order to exemplify the 
general notion. 

The classifier true: 1 — 2 for Sets is evidently also a subobject 
classifier for FinSets; indeed, the usual characteristic functions are still 
effective for finite sets. 

In Sets x Sets, an arrow is a pair of functions f: Y — X, f': Y’ > 
X'. The pair of subsets (1 C 2,1 C 2) is a subobject classifier, and 
the characteristic arrow of any subobject (S C X , S’ C X’) is evidently 
just the pair of characteristic functions (¢5: X — 2, dg: X’ > 2) from 
the category Sets. Thus, there are, in 2 x 2, four “truth-values”. The 
corresponding subobject classifier for Sets” has 2” truth-values; as we 
shall see, it is the Boolean algebra of all 2” subsets of n. 

In the category BG = G-Sets of representations of G [Example (iv) 
of §1], an object is an action X x G — X of the fixed group G on some 
set X, and a subobject is just a subset S C X closed under this action 
{ie., s-g € S whenever s € S and g € G). The complement of S in X 
is thus also invariant under this action, so we can still use the ordinary 
characteristic function gs : X — 2 of S, where the subobject classifier is 
the usual map true: 1 > 2, with G acting trivially on both sets 1 and 2. 
Exactly the same argument applies in the case where G is a topological 
group [Example (xi) of §1]. 

For BM [Example (v) of §1], an object is again a right action X x 
M — X of the fixed monoid M on some set X, and a subobject is 
again just a subset S C X closed under this action, but the previous 
characteristic function will not do because the complement of S need 
not be closed under this action. Instead, we may define a function ¢g 
sending each x € X to the set L of all those £€ M with x -£€ S. This 
set L is a “right ideal” of M (a subset of M mapped into itself by the 
right action of M on itself, via right multiplication). Therefore, take 
Q = 2 y, to be the set of all right ideals L of M with action N x M > 12 
defined by L-m = {ke M|m-k € L} for L € Qandm € M. Then, the 
function ¢@g above is an arrow ¢g: X — Q; in particular, it determines 
the given S as the inverse image of the right ideal M. Therefore, the 
subobject classifier is the function trueyz: 1 — Qy which sends the one 
point of the object 1 to the “maximal” right ideal M € Qa,. 

In case M is a group G the only right ideals are G and 9, so this 
OQg reduces to the previous set 2 with trivial G-action. In case M is the 
additive monoid of natural numbers, the right ideals are the empty set 
and the sets of numbers larger than some fixed number n. 

For the arrow category 2 and Sets?, a subset (So ase Si) (Xo = 
X}) is a pair of subsets Sp C Xo, S1 C X, with oSy C Sy. Relative to 
this subset S there are three sorts of elements x of Xo: Those x in Sp, 
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those x ¢ So with ox € S), and those x with ox ¢ S,. Define gox = 0, 1, 
or 2 accordingly. Then, ¢p on So, with the usual characteristic function 
¢, of S; C Xi, is an arrow ¢ = (0, ¢1) to the object Q displayed below, 


Xx: Xo ———_—— X} 
| “| I 00 = 0, o1 = 0, 02 = 1, 
2: {0,1,2} —,— {0,1}, 


in Sets”, and Sp — S) is the inverse image of ({0} ag {0}) =159. 
In brief, this characteristic function ¢ = ( do, d) ) is that arrow which 
specifies whether “z is in S” is “true” always, only at 1, or never. One 
may say that ¢ gives the “time till truth”. 
For Sets, a subobject of X has the form of a sequence S of subsets 


S: So —-—> Sy ————> Sg. ——— 83... 
X Xo o +X =e > Xo o +X... 


with oS, C S41; for example, if X;, is constant and each o = 1, this S 
is a monotone increasing sequence of subsets. For any z € X, we can 
then measure the “time till truth” (the time till inclusion in S') by the 
function ¢, on X,; defined as 


Oz = the least n with ox € Skin, if such exists, 


= oo otherwise. 


Then ¢,: X~ — N+ {oo}, so the sequence of these maps @, is an arrow 
to the sequence of sets 


Q: N+ {oo} + N + {co} 4 N+ {co} —-.. (1) 


where each 7 has 7(0) = 0, r(n +1) = n for n # 0 and r(oo) = ~@w. 
Then, 2 € SetsN has 1: {0} > {0} — {0} > --- as subobject, and the 
given S is the pullback of 1 along ¢. In brief, “time till truth” provides 
a subobject classifier 2. 

For an arbitrary small category C, a subfunctor of P: C°P — Sets 
is defined to be another functor Q: C°P — Sets with each QC a sub- 
set of PC and each Qf: SD — SC a restriction of Pf, for all arrows 
{:C — D of C. The inclusion Q — P is then a monic arrow in 
the functor category Sets©”, so that each subfunctor Q is a subob- 
ject. Conversely, all subobjects are given by subfunctors; if a natural 
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transformation 6: R»—> P is monic in the functor category, then each 
function 6C': RC — PC is an injection (monics, like limits in the func- 
tor category, are taken pointwise). For each C’ let QC be the image of 
RC — PC; thus Q is manifestly a subfunctor of P, and the given R is 
equivalent (as a subobject) to Q. 

For an arbitrary presheaf category C= Sets©”” if there is a sub- 
object classifier , it must, in particular, classify the subobjects of each 
representable presheaf yC = Homc(— ,C): C°P — Sets. Therefore, 


Subg(Homc(— ,C)) = Homg(Home( — , C), 2) = Nat(Home(— ,C), 2 


By the Yoneda lemma [see §1(6) above], the set on the right is (up to 
isomorphism) Q(C’). Thus the subobject classifier 2, if it exists, must 
be the functor 2: C°P — Sets with object function 


O(C) = Subs (Home( — , C)) 


= {S |S asubfunctor of Homc(—,C) }, (2) 
and with a suitable mapping function. 

To understand this it is customary and useful to use an alterna 
tive terminology for subfunctors of a representable functor Hom(-— ,C). 
Given an object C in the category C, a sieve on C (in French, a “crible” 
on C’) is a set S of arrows with codomain C' such that 


f € S and the composite fh is defined implies fh € S. 


If we think of the arrows f € S as those paths which are “allowed to 
get through” to C, this definition means that any path to some other 
B followed by an allowed path from B to C is allowed. For example, 
if the category C is a monoid M, a sieve is just a right ideal in M; if 
the category C is a partially ordered set regarded as a category, a sieve 
on C €C is aset S of elements B < C such that A < B € S implies 
AeéS: If B “goes through” the sieve, so does anything smaller: a sieve 
is a “downwards closed” subset. 
Now if Q ¢ Homc(—,C) is a subfunctor, the set 


S={f | for some object A, f: A— C and f € Q(A)} 
is clearly a sieve on C’. Conversely, given a sieve S on C, the definition 
Q(A) ={f | f: A> C and f € S$} C Home(A,C) 


yields a functor Q: C°P — Sets which is a subfunctor of the Hom- 
functor Homc(—,C). The passages S to Q and Q to S are reciprocal; 
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hence, we can identify sieves and subfunctors in any locally small cate- 
gory C. Thus, 


Sieve on C' = Subfunctor of Homc(—, C). (3) 


Moreover, for any arrow g: B — C’,, a subobject Q of the functor 
Homec(-— ,C) determines a subobject of Home( — , B) by pullback along 
g, and similarly each sieve S on C' determines the following sieve on B: 


S-g={h|goheS}. 


With this motivation, the proposed subobject classifier 9 for the 
functor category Sets© ” is defined on objects by 


2(C) = {S| S is a sieve on C in C} (4) 
and on arrows g: C’ — C by 
(—)-g: AC) > AC), —S-g={h|gohe S}. (5) 


For an object C of C, the set t(C) of all arrows into C is a sieve, called 
the mazimal sieve on C’. These maximal sieves patch together to give a 
morphism (natural transformation ) 


true: 1-0 (6) 


in the presheaf category Sets©’”. 

To see that (6) defines a subobject classifier in Sets©”” , consider any 
subfunctor @ of a given functor P: C°P — Sets. Then each morphism 
f: A — C in C determines a function P(f): P(C) — P(A) in Sets 
which may or may not take a given x € P(C) into Q(A) C P(A). Fora 
given x € P(C) set 


gc(x) ={f|x-f € Q(dom(f)) }, (7) 


where f ranges over all morphisms in C with codomain C. Then ¢c¢(z) 
is a sieve on C, and ¢: P — Q is natural. Moreover, ¢c¢(z) is the 
maximal sieve ¢(C’) iff x € Q(C), so the given subfunctor Q C P is the 
pullback along ¢ of the map “true” defined in (6) above. 


Ql 


| [ow (8) 
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This shows that ¢ is indeed a possible characteristic map for the 
subfunctor Q. But this ¢ is also the unique natural transformation 
6: P > Q making this diagram into a pullback. Indeed, given x € P(C) 
and f: A — C, the pullback condition means that z- f € Q(A) iff 
64(x-f) = true,; by naturality of 0, this is equivalent to 6¢(x)-f = true, 
and this, in turn, by the definition (5), means that f € 0c¢(x). The ele- 
ments f of O¢(x) are thus exactly those f with z- f € Q(A), i-e., those 
f € ¢c(2) as defined in (7). Thus, the definition (7) of ¢ is forced 
upon us if (8) is to be a pullback. Hence, we have shown that the mono 
true: 1 — 2 defined in (6) provides a subobject classifier for the presheaf 
category C = Sets” 

Intuitively, the sieve ¢¢(z) considered in (7) is the set of all those 
paths f to C which translate the element x of P(C) into the subfunctor 
Q. As the set of “paths to truth”, it clearly agrees with the characteristic 
arrows we have already constructed for the special functor categories 
Sets”, Sets”, and Sets. 

We have assumed C small because we must. Were C large—say the 
ordered set of all small ordinal numbers—the number of paths to truth 
would not in general be small, hence not an object of Sets. 

The exhibition of subobject classifiers for our typical categories is 
completed by noting, for any set J, that the projection J x 2— Jisa 
classifier for the category Sets/J, while in FinSets©” with C finite, the 
set 2(C) of sieves on C is again finite so provides a suitable subobject 
classifier 2. 

Observe, however, that there are many “reasonable” categories with 
no such subobject classifier. The category (FinSets)‘ provides an im- 
mediate such example, because in the linearly ordered set N°P, the num- 
ber of sieves on each object n is infinite. Another example is the category 
Ab of all (small) abelian groups. For, the terminal object 1 in Ab is 
the zero-group, so the group homomorphism true: 1—1 must send 0 
to 0 € Q, and thus its pullback along any ¢: A — 1* is the subgroup 
S = Ker ¢ = ¢~1(0) of A. This implies that the proposed subobject clas- 
sifier must be an abelian group which contains a copy of every quotient 
group A/S of every group A, an absurdity. 


5. Colimits 


Each of our typical categories has all finite colimits. To show this it 
suffices (as in the dual case of finite limits discusssed in §2) to observe 
that each has an initial object 0 and pushouts (or cocartesian squares, 
as they are sometimes called). In Sets, the empty set @ is an initial 
object because there is ie each set X exactly one function § — X; in 
a functor category Sets©’” the constantly empty functor is initial. And 
for a topological group G, the empty set (with its unique action by G) 
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is clearly initial. So all the examples of §1 have initial objects [we leave 
to the next chapter the discussion of sheaves, listed as (x) in §1]. 

Next, we consider pushouts. In Sets or in FinSets the pushout of 
two functions f and g with a common domain X, as in the diagram 


xX —— 


i ! (a) 


v 


ee >Q = YIIZ/{ f(x) = 9(2)}, 


is the set Q which is obtained from the disjoint union of Y and Z by 
identifying the elements f(x) and g(x), for all 2 € X. This quotient 
set has the usual universal property of a pushout. By this universal 
property, if in (1) X, Y, and Z are sets over some fixed set J (as in Ex- 
ample (ix) of §1), so is their pushout; therefore, this same construction 
yields pushouts in the comma category Sets/J. And similarly, if f and 
g in Diagram (1) are maps of G-sets for a group or monoid G [Exam- 
ples (iv) and (v) of §1], or maps of continuous G-sets for a topological 
group G [Example (xi)|, then the quotient Q can again be equipped with 
an action by G making Q into the pushout in the category of G-sets or 
continuous G-sets. Furthermore, if f and g in (1) are natural transfor- 
mations of functors X, Y, Z: C°P — Sets, then the pointwise pushouts 
Q(C) for each object C of C form (again) by universality a functor 
Q: C°P — Sets, which is the pushout in the functor category Sets©””. 
This is a special case of the fact that colimits in functor categories can 
be computed pointwise: If H: J > A© for categories J, A, and C, then 
its colimit lim | H in the functor category AC is given by 


(lim, H)(C)=lim H(C) (CeO), (2) 


where H(C): J — A is the functor obtained from H by evaluating at 
the object C of C [cf. (25) of the preliminaries]. This applies also when 
J is finite, as in our Example (xiv). 

This shows that our typical categories have finite colimits. [In fact, 
except for (xiii) and (xiv), they all have arbitrary small colimits, and in 
this sense are cocomplete, but that need not concern us here.| Our typi- 
cal categories have many other common formal properties. For example, 
each morphism f has an epi-mono factorization f = m-e. However, these 
other common properties will all be deduced (in Chapter IV) from the 
ones we have previously examined. In the deduction, limits will play a 
much more important role than colimits. 

To conclude this section, we wish to mention a useful fact concerning 
colimits in functor categories: 
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Proposition 1. In a functor category Sets”, any object P is the 
colimit of a diagram of representable objects, in a canonical way. 


This proposition asserts, in other words, that given a functor 

P: C°P — Sets, there is a canonical way of constructing a small “index” 

category J and a diagram A: J — C in C of type J, such that P is iso- 
A y oO 

morphic to the colimit lim (y o A) of the diagram J — C — Sets© ae 


obtained by composition with the Yoneda embedding described in (1.5). 

Given P, the index category J which serves to prove the proposition 
is the so-called category of elements of P, denoted by te P or, more 
briefly, { P. Its objects are all pairs (C,p) where C is an object of C 
and p is an element p € P(C). Its morphisms (C’, p’) > (C, p) are those 
morphisms u: C’ > C of C for which pu = p’; in other words, u must 
take the chosen element p in P(C) “back” into p’ in P(C”): 


(C’,p') = (C,p) by u: C’ >C with pu =p’. (3) 


These morphisms are composed by composing the underlying arrows u 
of C. This category has an evident projection functor 


Tp: [P- C, (C,p) eC. (4) 


Colimits over the category of elements can be used to construct a 
pair of adjoint functors which will have many uses, as follows. 


Theorem 2. If A: C — € is a functor from a small category C to 
a cocomplete category €, the functor R from € to presheaves given by 


R(E): C + Hom, (A(C), E) (5) 


has a left adjoint L: Sets©’” — € defined for each presheaf P in Sets©” 
as the colimit 


Tp A 
L(P) = Colim( i, poe e), (6) 
In other words, there is a pair of adjoint functors L 4 R, as in 
L: Setsc”” ee sR; (7) 


where we place the left adjoint L on the left. 


Proof: A natural transformation 7: P — R(E) is just a family 
{ 7c } indexed by objects C of C for which each 7c is a map 


tc: P(C) ~ Home(A(C), E) 
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of sets which is natural in C, in the sense that the diagram of sets 
P(C) —“2— Hom¢(A(C), E) 
Peo] [aco (8) 
P(C’) —z5-— Hom¢(A(C"), E) 


commutes for each morphism u: C’ — C' of C. But such a7 may also 
be considered as a family of arrows of E 


{to(p): A(C) > E}ep) (9) 


indexed by objects (C,p) of the category f P of elements of P. In this 
view, the condition (8) then means that the following diagram 


A(C) = Anp(C,p) 


To(p) 


A(u) Ux E (10) 


Za 
A(C’) = Amp(C’,p’) 
commutes for each arrow u. This visibly means that the arrows 7c(p) 
constitute a cocone from the functor Amp to the object E. By the defi- 
nition of a colimit, each such cocone comes by composing the colimiting 
cocone (to lim) with a unique arrow from the colimit LP to the object 
E. In other words, there is a bijection 


Nat(P, R(E)) = Home(LP, E). (11) 


Since this bijection is clearly natural in P and in E, it asserts that L is 
a left adjoint to R, just as claimed. 


Corollary 3 (= Proposition 1). Every presheaf is a colimit of 
representable presheaves. 


Proof: In the theorem, take € to be the presheaf category and A 
to be the Yoneda embedding 


Az=y:C-— Sets®” =C=€. 


By the Yoneda lemma the definition (5) of the right adjoint R for any 
E = P is then 


Ra(E)(C) = Homa (y(C), £) = E(C); 
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this states that R,4 is isomorphic to the identity functor of C. By the 
uniqueness up to isomorphism of adjoints, its left adjoint ZL must then 
also be isomorphic to the identity functor, so that the definition (6) of 
L gives for any presheaf P 


Pe Cotim( f P 2 Os), (12) 


One may prove the result directly, by constructing a colimiting cone 
from L(P) in (6), with A= y, to P. 

The Yoneda embedding of C in the presheaf category C is the “uni- 
versal” way of making C cocomplete, in the sense that every functor 
A from C to a cocomplete category € factors through y by a unique 
colimit preserving functor from presheaves to €, as follows. 


Corollary 4. For each functor A: C — € from a small category 
C to a cocomplete category € there exists a colimit preserving functor 
L: Sets©” — € for which the following diagram (with the Yoneda 
embedding y) commutes 


| ae (13) 


The functor L with these properties is unique up to isomorphism, and 
can be defined as in (6) by a colimit. 


Proof: It will suffice to prove that L of (6) makes the diagram (13) 
commute; then L, as a left adjoint, preserves colimits. Moreover, since 
every presheaf P is a colimit of representable presheaves, L is unique up 
to isomorphism, as asserted in the corollary. 

To prove that (13) commutes, note that when P = Hom(—,C) = 
yC is representable the corresponding category of elements { P has a 
terminal object—the element 1: C — C of P(C). Therefore the colimit 
of the composite A op will be just the value of Ao 7p on the terminal 
object. Hence, 

Ly(C) = Arp(C,1¢) = A(C) 


so the diagram does commute. 


The process C +> G is a functor from Cat, the (large) category of 
all small categories, to Cocomp the (“superlarge”) category of “all” 
cocomplete categories, with morphisms all colimit preserving functors. 
This corollary states in effect that the Yoneda embedding provides uni- 
versal arrows and so, like universal arrows generally, constitute the units 
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of an adjunction—an adjunction in which C C is left adjoint to the 
forgetful functor Cocomp — Cat —forget cocompleteness. This sug- 
gestive formulation stumbles on the fact that cocomplete categories are 
hardly ever small [CWM, p. 110], so do not become small by forgetting 
the colimits! 

It is convenient to picture the category { P of elements of P in terms 
of its projection 7p by a diagram 


[2 pf -----9 pe P(C) 


C —_>; Sets, 


(14) 


f--=e4 


! 
Cc’ —— C. 


(Here the object function of the functor P is a set indexed by the set 
Co of objects of C, and the objects of { P form the corresponding set 
“over” Co, in the sense described in §1: J-indexed sets ~ sets over J.) 
In this diagram (14) the inverse image under z of an object C' of Co is 
the set of all p € P(C), while the projection 7 has the property that 
for each such x € P(C) and each u: C’ > C there is a unique pair p’, 
u’: (C’,p’) > (C,p) with rp’ = C’, r(u’) = u. Any functor 7: E> C 
with this latter property is called a fibration of categories; in geometric 
terminology, given a point p over C each arrow in the base with target 
C lifts uniquely to an arrow “upstairs” with target p. 

A construction similar to (14) may be carried out when Sets is 
replaced by the category Cat of all small categories; it is often called 
the Grothendieck construction, but the case of { P above was first done 
by Yoneda and developed by Mac Lane well before Grothendieck. 


6. Exponentials 


The basic arithmetic operations on numbers and on sets are +, times, 
and exponent. We have already described + and times categorically as 
coproduct and product, respectively; we now consider the exponent Z*. 
In Sets, this Z* is the usual “function set”, consisting of all functions 
h: X — Z. It may be described by the familiar bijection 


Hom(Y x X,Z) > Hom(Y, Z*) (1) 


which sends each function f: Y x X — Z of two variables into the 
function f’: Y > Z* where, for each y € Y, f’y € Z* is the function 
with (f'y)(z) = f(y,z) € Z. This bijection (1) completely determines 
Z* up to isomorphism; for setting Y = 1 yields Z* ~ Hom(1, Z*) = 
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Hom(X, Z). The bijection (1) is natural in Y, X, and Z and so states 
that the functor (— )* is the right adjoint of — x X: Sets — Sets. 

Now consider any category C with finite products. Then for each 
object X of C the evident assignment Y + Y x X determines a functor 
— x X:C — C, called “product with X”. When this functor has 
a right adjoint, written Z + Z*, we say that C has an exponential 
for X; this means that there is a bijection (1) natural in the objects 
Y and Z of C. When this holds for all objects X, it implies, by the 
“parameter theorem” for adjunctions [CWM, p. 100], that (X,Z) 1 
Z* is a functor C°P x C = C called the exponential for the category 
C. 

The existence of the adjunction (1) can be stated in elementary terms 
(ie, without using Hom-sets). For, set Y = Z* in (1); the identity 
arrow 1: Z* — Z* on the right in (1) then corresponds under the 
bijection to an arrow e = ez,x, 


e: Z* xX oZ (2) 


called evaluation; in Sets this arrow e is the actual evaluation e(h, rz) = 
h(x) of the function h: X — Z at the argument x € X. The bijection 
f + f' of (1), by naturality, now becomes the statement that to each 
f:Y x X > Z there is a unique f’: Y — Z* such that the diagram 


YxxX 
fix f 


‘ 
v 


OX = 97 


commutes. One also says that e is universal from — x X to Z. Also, e 
is the counit of the adjunction (1); the adjunction can also be described 
in terms of its unit 7: Y — (Y x X)*, as explained in the preliminaries. 

The evaluation e = ez,x of (2) is natural in Z and “dinatural” in 
X, The latter notion applies because the domain of e is a bifunctor in 
X contravariant in the first X and covariant in the second; dinaturality 
means (see [CWM, p. 214]) that for every arrow t: X — X’' the diagram 


ZX yx XM*L, Xx x 
| I 
BA OS 


commutes. One also has natural isomorphisms Z**¥ ~ (ZY)* and 
Z} & Z, with the evident values of the respective evaluations. 
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Recall that a category C is cartesian closed when it has a terminal 
object 1 and a binary product X x Y for any two objects X and Y, as well 
as exponentials Y* (with their evaluations, for all objects X and Y). 
For example, the category Sets of all small sets and the category Cat 
of all small categories are both cartesian closed. Also, any product of 
cartesian closed categories is cartesian closed: both finite products and 
exponentials can be computed “termwise” in such a product category. 

In any cartesian closed category, there are natural isomorphisms 


1* ~1, Xi 2x, (3) 
VRQ ayY* xg, DOSS ey (4) 


The last equation of (4) follows from the definition of the exponential 
and the associativity of the product; the first of (4) holds because (—)* 
has a left adjoint, hence preserves products. 

All our typical categories are cartesian closed. The case of sheaves 
will be discussed in the next chapter. The following proposition takes 
care of most of the other cases. 


Proposition 1. For any small category C, the functor category 
Sets©”” is cartesian closed. 


To prove this proposition, recall that the product of two functors P 
and Q: C°P — Sets is their pointwise product. However, we cannot use 
a “pointwise” exponential Q?(C) = Hom(PC,QC) because the right- 
hand side here is not a functor of C' in any reasonable way. 

To find a formula for the exponential, we first assume that an expo- 
nential Q? exists, so that Hom(R x P,Q) ~ Hom(R, Q”) for all R. In 
particular, for each representable functor R = Homc(-— ,C) = yC, this 
isomorphism composed with the Yoneda isomorphism gives 


Q?P(C) = Homa (y(C), Q’) 
= Home(y(C) x P,Q). 


Now drop the assumption that Q? exists, but use this result to define 
Q? as the functor 


Q?(C) = Homg(y(C) x P,Q); (5) 


ie, Q?P(C) is the set of all natural transformations @ from 
Homc(—,C)x P to Q. This clearly defines a functor Q? : C°P — Sets. 

Associate with this definition (5) a putative evaluation map e: QP x 
P = Q with components 


ec(9,y) = 9c(1e,y) € Q(C) (6) 
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for C € C, 6: Home(—,C) x P — Q and y € P(C). It follows that e 
is a natural transformation. Moreover, to every natural transformation 
@: R x P = Q one can find a (unique) ¢’: R — Q? such that the 
diagram of natural transformations 


RxP 


xa SS (7) 


Q? x P—z>Q 


is commutative. Specifically, for C € C and u € RC, we must de- 
fine an element ¢4(u) € Q?P(C), that is, a natural transformation 
é¢(u): Home(—,C) x P > Q. We define the components (¢¢(u))p 
for f: DC and z € P(D) in terms of ¢ by 


(¢c(u))p: Home(D,C) x P(D) > Q(D), 
(f,2) > dp(u- f,z). (8) 


The ¢@’ so defined is clearly natural in D. Moreover, by the definition 
(6) of the evaluation e, for u € R(C) and y € P(C), 


ec(¢o(u), ¥) = (¢e(4))c(1e, y) 
= bc(u, y), by (8). 


This means that the triangle (7) commutes, and that this condition 
plus naturality forces our definition of ¢’. Therefore Q? is the required 
adjoint and Proposition 1 is proved. 

A somewhat different description of the same exponential appears 
as Exercise 8. The meaning of the formula (5) for the exponential Q? is 
also illuminated by the special case in which the category C is a monoid 
or a group (Exercise 5). 

It follows from Proposition 1 that most of our typical categories are 
cartesian closed. [Examples (xiii) and (xiv) of §1 are similar to (i) and 
(viii).] Besides the case of sheaves, which will be discussed in the next 
chapter, this only leaves the case of continuous G-sets for a topological 
group G; a construction of exponentials in this category is outlined in 
Exercise 6. 

A global section of a functor P: C°P — Sets is defined to be a 
function y which assigns to each object C of C an element yo € P(C) 
in such a way that the equation 


yo-f=yp, ff: DC, (9) 


holds for every arrow f of C. Thus 7¥ is just a natural transformation 
y:1— P, where 1 is the constant functor 1 on C°P. (The geometric 
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origins of the term “global” will appear in Chapter II below.) The set 
I'(P) of all global sections y of P yields a functor 


T: Sets©” — Sets. 


In the opposite direction, the constant presheaf functor A assigns to 
each set S the functor AS with (AS)(C) = S and every (AS)(f) the 


identity. For each S and P there is a natural isomorphism 
Homg(AS, P) = Homsets(S, TP) (10) 


since a natural transformation AS — P simply assigns to each element 
s € S its image, a global section 1 — P of P. Therefore, the functor A is 
left adjoint to the global sections functor [. This adjunction (where the 
left adjoint A is left exact) is a first instance of what will later be called 
a “geometric morphism” (Chapter VII). Also, a natural transformation 
AS — P is just a cone from the set S to the functor P to Sets. Hence 
(10) states also that IP is exactly lim P. 

We can now summarize the common properties of our typical cate- 
gories. They are categories € with the following properties 


(i) € has all finite limits and colimits, 
(ii) € has exponentials, 
(iii) € has a subobject classifier 1 > 2. 


A category € with these properties will be called an elementary topos; 
in brief a topos (plural: topoi). Each topos is, in particular, a cartesian 
closed category. 


7. Propositional Calculus 


The propositional calculus considers “Propositions” p, q, r,... com- 
bined under the operations “and”, “or”, “implies”, and “not”, often 
written as pA gq, pV q, p > q, and —p. Alternatively, if P,Q, R,... are 
subsets of some fixed set U with elements u, each proposition p may be 
replaced by the proposition u € P for some subset P ¢ U; the proposi- 
tional connectives above then become operations on subsets; intersection 
A, union V, implication (P > Q is ~P AQ), and complement of subsets. 
These four operations satisfy various identities, so that the subsets of 
U under these operations constitute a Boolean algebra. In this way a 
Boolean algebra is the algebraic correlate of the classical propositional 
calculus. If, instead, one takes the intuitionistic propositional calculus, 
as formalized by Heyting, one obtains a different algebraic system on 
the same operations A, V, =>, =; such a system is known as a Heyting 
algebra. The typical model is not the set of all subsets of some set, but 
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the set of all open subsets of some topological space X; in the model, 
the operations A and V still correspond to intersection and union, re- 
spectively, but the other two operations must be reinterpreted (in order 
to give open sets). Thus U = V is the largest open set W such that 
WAU CY, while -U is the interior of the complement of U (the largest 
open set disjoint from U). 

We now formulate the exact definitions, beginning with the notion 
of a lattice. A lattice D is a partially ordered set which, considered as a 
category, has all binary products and all binary coproducts. If we write 
zx, y, and z for objects of L, then x < y if and only if there is a (unique) 
arrow z — y, the coproduct of z and y is the least upper bound (or 
sup) z V y and the product is the greatest lower bound (or inf) x A y. 
If a lattice L has elements 0 and 1 such that 0 < x < 1 for all x € L, 
then 0 and 1 are the (unique) initial and terminal objects, respectively, 
of L, considered as a category. Thus a lattice with 0 and 1 is a partially 
ordered set which, considered as a category, has all finite limits and all 
finite colimits. 

A lattice with 0 and 1 can also be defined equationally, as a set with 
two distinguished elements 0 and 1 and two binary operations V and A, 
both of which are both associative and commutative and which satisfy 
the added identities 

ctAr=a2, cVr=a, 
lAg=a, OVae=z, (1) 
rA(yVz)=r2=(rAy)Ve. 
These equations on the operations 
A: LxLoL, Vv: ExLbotLl, 0,1:130L 


can be used to define a “lattice object” DZ in any category C with finite 
products. Here they follow from the above definitions of A and V in 
terms of the partial order. And, given these equations, the partial order 
can be recovered because x < y holds in L if and only if c = x A y (or, 
equivalently, y = x V y). 

A distributive lattice L is a lattice in which the identity 


tA (yVz)=(t&Ay)V(eAz) (2) 
holds for all x, y, and z. This identity implies the dual distributive law 
gV(yAz)=(r2Vy)A (eV 2z). (3) 


For, the right-hand side of (3) expands by (2) and then (1) to give, by 
associativity, 
(cVy)A(@Vz)=[@V¥)Aa]V[(evy)Az] 
=aV((zAz)V(yAz)j 
= fr v(z@Az)lV(¥Az)=2V(yAz). 
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A complement for an element xz in a lattice L with 0 and 1 is an element 
a € L such that 


rAa=0, zrVa=1. (4) 


In a distributive lattice a complement a, if it exists, is unique. For let b 
be another complement to x. Then 


b= bA1L=bA(cVa)= (bAz)V (BAG) 
= (x Aa) V (bAa) 
=(rVb)Aa=a. 


We denote the unique complement a of x, when it exists, by a = 2. 
A Boolean algebra B is a distributive lattice with 0 and 1 in which 
every element x has a complement —2; thus, 


xrAn7xz = 0, gVrz=1. (5) 


One may readily verify the additional properties 


“(e@Vy)=72A-y, 7(z Ay) = (72) V(-y), (6) 
ae = 2: (7) 


The identities (6) are called the DeMorgan laws. 

The partially ordered set of all the subsets of a given set is always a 
Boolean algebra. A basic theorem due to M. H. Stone asserts that every 
Boolean algebra is isomorphic to an algebra of some of the subsets of 
some set U. We will use this theorem in Chapter IX. 


8. Heyting Algebras 


A Heyting algebra H (also called a Brouwerian lattice) is a poset 
with all finite products and coproducts which is cartesian closed (as a 
category with products). In other words, a Heyting algebra is a lattice 
with 0 and 1 which has to each pair of elements x, y an exponential 
y*. This exponential is usually written as x = y; by its definition it is 
characterized by the adjunction 


z<(e@=>y) ifandonlyif zAr<y. (1) 


In other words, x => y is a least upper bound for all those elements z 
with z Ax < y; in particular, then, y < (x > y). Thus, in the usual 
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picture of a partially ordered set, x => y lies above y, 


2 OL 
‘7 


but only so far above that its intersection with z is still ¢ Ay. For 
example, for any topological space X the set Open(X) of all open sets 
in X is a Heyting algebra: It is a lattice (under inclusion) because binary 
unions and intersections of open sets are open, as are the sets @ and X. 
For two open sets U and V the exponential U = V can be defined, as 
suggested by (1), as the union [J W, of-all those open sets W; for which 
W;NU c V. Then, because intersection is distributive over arbitrary 
unions, 


(Jwiynu =(Winv) cv. 


Therefore, JW; = (U > V). 

A similar argument will show that any complete and (infinitely) dis- 
tributive lattice is a Heyting algebra. Here a lattice is said to be complete 
when, regarded as a category, it has all small limits and small colimits, 
Le., all small products and coproducts. 

In a Boolean algebra, for all x, y, and z, 


z<(-tVy) ifandonlyif zAr<y. 


Proof, only if: zAz <(-r Vy)Ar<yAe <y; iff z=zAl= 
zN\(n@V 2) = (zAna)V(zAz) <arVy. 
Hence a Boolean algebra has exponentials given by 


(x >y)=r72Vy. 


This is the classical definition of material implication > (for proposi- 
tions, p => gq is “not p or gq”). Therefore, every Boolean algebra is a 
Heyting algebra. The converse does not hold; for example, the open sets 
in the real line form a Heyting algebra which is not Boolean (because 
the complement of an open set need not be open). 

Now we examine the identities valid in any Heyting algebra. For any 
cartesian closed category with objects X, Y, the unit and the counit of 
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the basic adjunction defining the exponential are natural transforma- 
tions 
XA (XxY)’, Yo aX, 


For a Heyting algebra these become the inclusions 
e<(y>(cAy)), yAyra)<e. (2) 
The properties 1* ¥ 1 and X! & X of the exponential become 
(x > 1)=1, (l>s)=zc. (3) 
Since the functor x => (— ) is a right adjoint, it preserves products, so 
(2 = (yA z)) = ((c > y) A(@ = 2), (4) 


while the associative law for the product of objects implies XY*? & 
(XY)? which becomes 


(yA 2) > 2) =(z> (y>2)). (5) 
Also — Ay is a left adjoint, so must preserve coproducts, as in 
((x Vz) Ay) = (Ay) V (2 Ay). (6) 


This means that the underlying lattice of any Heyting algebra is dis- 
tributive. 

Intersection in any lattice is commutative. For a Heyting algebra H 
this means that, for all x, y, and z, 


z2<(@>y)jiffzAr<yiffeAz<yiffa<(z>y). 


Now — => y (like any exponential) is a contravariant functor in the 
argument —. Hence, in this display we may regard — => y on the right 
as a functor from H to H°? and on the left as a functor from H°? to H; 
the equivalence then asserts that the first — = y is left adjoint to the 
second — => y. Since any left adjoint preserves coproducts, this means 
that — => y carries coproducts to products (coproducts in H°P), as in 
the identity 

(x Vz) => 9) = (x > y) A(z > y))- (7) 
If one interprets x, y, and z as propositions, with A as “and” and V as 
“or”, all the equations (2) through (7) become familiar properties of the 


implication relation => for propositions. 
In any Heyting algebra we define the negation of x as 


an = (x = 0). (8) 
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Thus “not xz” means “x implies falsity” or “x implies absurdity”. In 
view of the definition of =, this can be rewritten as 


y<s-e iff yArc=0. (9) 


In other words, in a complete lattice =x is the union of all those y which 
meet z in 0. For example, in the case of the Heyting algebra Open(X) of 
all open subsets of a topological space X, the negation 4U is the union 
of all open subsets of X which do not meet U, so is the interior of the 
set-theoretic complement of U; that is, the set-theoretic complement of 
the closure of U. Thus —-U is the interior of the closure of U, which 
may be larger than U’, as for example when U is a suitable open subset 
of the line or the plane. This example shows that -—z need not equal z. 
Moreover, 2 is not necessarily a complement of x; though x A 7x = 0, 
it may not be the case that 2 V 72 = 1. 
Some of the familiar properties of negation still apply, as follows. 


Proposition 1. In any Heyting algebra H, 


x<a7r, x < y implies sy < 72, (10) 
ar = 777, (11) 
a-(z Ay) = a2 A ay. (11’) 


Proof: Since x A -z = =x Ax = 0, the first follows by (9). The 
second of (10) states that —: H — H°P is a functor. Explicitly, if x < y, 
then cA -y < yA-y = 0, so ~y < 72, again by (9). This result and « < 
—3r gives 7772 < 72, while x < 472 holds for all x, hence for sz, and 
so gives —z < 4772. Hence, (11) holds. Furthermore, two applications 
of (10) to the inequality cA y < x yield =7(x% Ay) < 772. Similarly, one 
derives —7(xAy) < —7y; therefore, =-(xAy) < -72/A--7y. To show the 
converse inequality 7x2 A —7y < =7(x@ Ay) we use the commutativity 
and associativity of the meet A, together with (9) and (11), as follows: 


a8 A aay < A(2 A y) iff a-e@As>yA-7(t@Ay)=0_ by (9), 


iff ay A(t Ay) < a7 by (9), 

iff sy A(z Ay) <7 by (11), 

iff —ayA-(r@ Ay) Ar =0 by (9), 

iff a(n Ay) Ax <—7>y=-7y by (9) and (11), 
iff “(rxAy)AcrAy=0 by (9). 


But in any Heyting algebra, the identity sz A z = 0 holds, as an imme- 
diate consequence of the definitions (1) and (8). 


54 I. Categories of Functors 


Proposition 2. In a Heyting algebra, if an element x has a com- 
plement, that complement must be 72. 


For this reason, =z is sometimes called the pseudo-complement of x. 


Proof: Suppose that x has a complement a, with « A a = 0 and 
xVa=1. By the first of these equations, a < -x. By the second and 
the distributive law, 


ae = a0 A(x Va) =-72 Aa; 


hence —r < a. Combining these results, a = 72, as asserted. 


Boolean algebras can be defined equationally, specifically by the as- 
sociative and commutative laws and the equations (7.1), (7.2), and (7.5) 
on the operations A, V, — and the elements 0 and 1. A corresponding 
result holds, in a more subtle way, for Heyting algebras; this will be used 
later to define Heyting algebra objects. 


Proposition 3. In a Heyting algebra H the implication, =>, satisfies 
the following identities, for all elements x, y, z € H: 


(x +2) =1, (12) 
rA(es>y=rary, yA(z>y)=y, (13) 
zr>(yAz=H=(«>y)Aw>2). (14) 


Conversely, in any lattice L with 0 and 1 a binary operation => satisfying 
these identities must be the implication of a Heyting algebra structure 
on the lattice L. 


Proof: Since yA x < x for all y, the definition of > x shows that 
(12) must hold. By the definition of > again, r Ay < (x => y), while, 
by evaluation, 2 A (x = y) < y; hence, the first of (13) holds. By the 
definition of => once more, y < (z = y), which gives the second of (13), 
while (14) is just (4), the fact that c > — preserves products. 

The equations (12), (13), and (14) represent familiar properties of 
implication. Also (14) with z = z, when combined with (x > z) = 1 
from (12), gives 

(c>(yAz))=r>y. (15) 


Since (14) states that the operation (x = —) preserves products, it also 
shows that (x = —) preserves inequalities, so that 


a<b implies (>a) <(¢@=> 5). (16) 


For the converse of the proposition, we must show for any lattice that a 
binary operation => satisfying equations (12), (13), and (14) necessarily 
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satisfies the definition of an exponential; in other words, that (12), (13), 
and (14) imply 


z<(e@=>y) ifandonlyif zAr<y (17) 


for all x, y, and z. Given the left-hand inequality, the first part of (13) 
yields 

zAg<(r>y)Ar=rAysy, 
which is the right-hand inequality of (17). Conversely, given that in- 
equality, one has 


z=zA(r>z)<u>z, by (13), 
z< [x= (zAz)), by (15), 
[ce > (zAx)]|<(x=>y), by assumption and (16), 
z<(c@>y), by transitivity. 
This completes the proof of (17). 


Proposition 4. A Heyting algebra is Boolean if and only if-7z = x 
for all x € H, or, if and only if x V -x = 1 for all x. 


Proof: Since the complement is unique in a Boolean algebra, x is 
the complement of =x, so the equation =r = x holds there. Conversely, 
in any Heyting algebra, by (8) and (7), 


“(x Vy) = (2 Vy) > O= (@ > 0) A(y = 0) = (-2) A (-7y). 


This is one of the DeMorgan laws (7.6). Now if also —>x = = for all z, 
one has by this law 


= 0 
=. 


Since always x A ~x = 0, this shows that =z is a complement of 2, and 
hence that H is indeed Boolean. 


The analogous characterization of Boolean algebras by x Vz = 1 is 
immediate. The assertion x Vz = 1 is the famous “tertium non datur” 
of classical logic, doubted by intuitionists and constructivists. 

As already observed, negation is a functor «=: H > H°? and also 
H°P —, H. Since x < -y iff y < 72, this functor is adjoint to itself. 
Thus, a Heyting algebra is Boolean iff this adjunction is an equivalence 
(actually, an isomorphism). 

The relation between Heyting algebras and our typical categories 
discussed in §1-6 lies in the fact that the poset of subobjects of a given 
object in any such typical category is always a Heyting algebra (and 
sometimes a Boolean algebra). Most cases are taken care of by the 
following proposition. 
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Proposition 5. Consider the functor category C = Sets©” of a 
given small category C. For any object P of C, the partially ordered 
set Subg(P) of subobjects of P is a Heyting algebra. 


Proof: Under pointwise operations, the set Sub(P) of all subfunc- 
tors of P is a complete lattice, satisfying the infinite distributive law. 
Hence (as for open sets above), it is a Heyting algebra. We list the 
explicit description of the operations A, V, 0, 1, =, and —, and leave 
further verification to the reader. If S and T are two given subfunctors 
of P, then their least upper bound SVT and their greatest lower bound 
S AT may be defined pointwise, as the functors 


(SVT)(C) = S(C)UT(C), 
(SAT)(C) = S(C)NT(C), 


since S(C) and T(C) are both subsets of P(C). The implication S > T 
is defined for C' in C by 


(S > T)(C) = {a € P(C) | for all f: DC 
in C: ifa-f € S(D) then x- f € T(D)} (18) 


(The pointwise definition doesn’t work, because it doesn’t give a sub- 
functor.) The largest and smallest subfunctors of P are respectively the 
functor P itself, and the empty functor 0 [with 0(C’) = @ for all C1. 
Consequently, negation can be described explicitly for a subfunctor S as 


(AS)(C) = {x € P(C) | for all f: D> Cin, 


ef ¢ S(D)}. 

From the description of negation, it is clear that the identity SVS = 
P need not hold in general (e.g., take C = 2). 

In the particular case of BG = G— Sets for a group G [Example (iv) 
of §1], a subobject of a given G-set X is just a subset S of X which is 
closed under the action by G (i.e., x € S implies x -g € S for allgeG 
and x € X). If S and T are two such subsets closed under the action, 
then so are SUT, SNT, and X — S = —S. So, in this case, the Heyting 
algebra structure of Sub(X) is just the restriction of the usual structure 
on the power set of X. In particular, Subpg(X) is a Boolean algebra. 
The same reasoning applies to the category BG of continuous G-sets for 
a topological group G. 

A poset P is complete iff every subset of P has an 1.u.b. (a “sup” or 
a join) and a gb. (an “inf” or a meet); actually it suffices to require 
the existence of all lu.b.s. Thus P is complete as a poset iff P as a 
category has all limits and all colimits. A complete poset is necessarily 
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a lattice with 0 and 1. We will often have to do with complete Heyting 
algebras (cHa’s) and complete Boolean algebras (cBa’s); they are, of 
course, Heyting or Boolean algebras which are complete as posets. The 
algebra Open(X) of all open subsets U of a topological space is a cHa 
with the usual sups. However, the inf of a family {U; | i € I} of open 
sets is just the largest open set contained in all the U; and so is not 
usually the set-theoretic intersection of the sets U;. 


9. Quantifiers as Adjoints 

Our discussion of the propositional calculus has indicated that the 
basic operations A, V, and => of this calculus can all be described as 
adjoints. We turn now to the more subtle question of interpreting the 
quantifiers of the usual predicate calculus as adjoints, too. 

Consider quantifiers (Vz) and (Ar)—that is, “for all z” and “there ex- 
ists an x” —as applied to a predicate S(x,y), where x and y are elements 
of sets X and Y, respectively. If we regard S as the subset S C X x Y 
of those pairs (x,y) for which S(z, y) is true, then (Vxr)S(a, y) similarly 
denotes a related subset T ¢ Y consisting of all those y with every pair 
(c,y) € S. Writing p: X x Y — Y for the usual projection, we will 
denote this subset T, corresponding to (Vx)S(z,y), as VpS; similarly 
dpS denotes the subset corresponding to (Sr)S(z,y). Now let PY be 
the Boolean algebra of all subsets T C Y and P(X x Y) the Boolean 
algebra of all S. Then PY and P(X x Y) can be viewed as categories, 
while the functions V, and dp, since they preserve the inclusion relation 
S Cc S' between subsets, are functors 


Vp, dp: P(X x Y) > P(Y). (1) 


Theorem 1. For the projection p: X x Y — Y, the functors 4, and 
Vp are respectively left and right adjoints to the functor p*: P(Y) — 
P(X x Y) which sends each subset T C Y to its inverse image p*T 
under p. 

As usual, the inverse image is p*T = {(z,y) | y € T}; it may also 
be described as the pullback of T — Y along p. 

Proof: For subsets S C X x Y and T CY one evidently has 


pT CS ifand only if TCV,S, 
Scp*T ifandonlyif 3,5 CT. 
Since p*T C S means exactly that the set Hom(p*T, S) in the category 


P(X xY) is nonempty (with one element) and so on, these equivalences 
give precisely the asserted adjunctions. 


Much the same argument applies when the projection p is replaced 
by an arbitrary function f. 
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Theorem 2. For any function f: Z — Y between sets Z and Y the 
inverse image functor f*: PY — PZ between subsets has left and right 
adjoints, dy and Vy. 


Proof: The left adjoint 4s assigns to each S C Z its image 4S CY, 
which may be described with a “there exists” as 
4,S = {y | there exists a z with fz = y and z € S$}. 
The right adjoint Vz assigns to each S the set 


V*S = {y | for all z, if fz =y, then z € S}, 


described with a “for all’. The proof that these are adjoints, as stated, 
is immediate. The notations 4, and Vy are chosen to match the special 
case of a projection f = p, when these adjoints correspond to ordinary 
quantifiers. 


The result (left adjoint on the left) is the diagram 


Z PZ 
i alg (2) 
Y, PY. 


This construction has provided adjoints for the operation f* of pulling 
back a subobject of a set Y. 

More generally, such adjoints exist not just for subsets S of a set Z 
but for arbitrary sets B over a given set Z. Indeed, for each function 
f: Z—Y, as in Theorem 2, consider the pullback functor 


f*: Sets/Y — Sets/Z (3) 

defined for any set A over Y by 
fr(A>Y)=AxyZ—>Z. (4) 
If we identify a set A over Y with a Y-indexed family { A, | y € Y } of 
sets, as in (1.8), the pullback functor (2) corresponds to “reindexing” via 


f: it sends a Y-indexed family { A, | y € Y } to the Z-indexed family 
{ Ag(z) | ZzE ZY}. 


Theorem 3. For any function f: Z — Y between sets, the pullback 
functor f*: Sets/Y — Sets/Z has both a left and a right adjoint. 
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Proof: By the equivalences Sets/Y & Sets” and Sets/Z = Sets”, 
we may as well prove that the reindexing functor 


f": Sets’ — Sets”, f*({Ay| ye Y }) ={ Apa |z€Z} 
has both adjoints. The left adjoint is 
Lp: Sets” — Sets” , 


defined for a Z-indexed family B = { B, | z € Z} by 


(By Sy Ba (5) 


f(z)j=y 


where © denotes the coproduct (disjoint union) of the sets B,. The right 
adjoint 
Ily: Sets” — Sets” 


is defined by a cartesian product II as 


(H(B))y= [J Be. (6) 


f(@)=y 


In words, given an indexed set B = {B, | z € Z}, the indexed set 
%(B) has at index y the coproduct of all the sets B, for which f(z) = y. 
This set B, is the “fiber” over z. Dually, IfyB is the product over the 
fibers. As for the proof that Uy is left adjoint to f*, just observe that 
an indexed family of maps hy: (XB), — Ay (y € Y) is the same thing 
as an indexed family of maps B, — Af z) (z € Z). The proof that Hy 
is right adjoint to the pullback f* is similar. 


More generally, suppose C is an arbitrary category with pullbacks. 
Then for each morphism of objects f: B’ > B in C, pulling back along 
f induces a functor between the corresponding slice categories 


ft: C/B > C/B’ (7) 


(the functor, of course, depends on the particular choice of the pull- 
backs). C/B is also called the category of objects over the “base” object 
B, and a functor of the form (7) is also called a change of base functor. 
Theorem 3 is a special case of the following result: 


Theorem 4. Let C be a category with pullbacks, and let B be an 
object of C. For each f: B’ — B, the change of base functor f*: C/B- 
C/B’' has a left adjoint; moreover, if C/B is cartesian closed, each such 
f* also has a right adjoint. 
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The left adjoint is given by composition with f. 
With Ly the left adjoint and IIy the right, the result is 


B’ C/B’ 
| of . 
B, C/B. 


We prove the theorem first in the case when B = 1 is the terminal object 
of C, so that C/1 is just (isomorphic to) C, while pullback along the 
unique arrow B’ — 1 is just the functor 


-— xB’: C5C/B’, 


sending each object X to the object X x B’ — B’ over B’ (by projection). 
Take any object h: Y — B’ in C/B’. An arrow from this object Y — B’ 
to X x B’ — B’ in C/B’ is then just an arrow from Y to X in C; hence, 
a left adjoint to — x B’ is the forgetful functor LU: C/B’ — C given by 
XY — B’)=¥. 

On the other hand, an arrow from X x B’ > B’ toh: Y > B’ in 
C/B’ is just an arrow t: X x B’ > Y in C such that ht is the projection 
X x B' = B’. By exponential adjunction, these arrows t correspond 
to those arrows t/: X — Y®" for which h®’ ot’ is the composite X > 


j ’ 
13> B® , where 7 arises by exponential adjunction from the identity 
B' — B’. These arrows ?¢’ in turn correspond by pullback exactly to the 
arrows t’: X — Th, where [h is the pullback in the square 


Therefore, PA, the pullback of h®’ along j, is the desired right adjoint 
to — x B’. 

Note that, if C = Sets, this pullback [A is just the set of those 
functions on B’ to Y whose composite with h: Y — B’ is the identity 
of B’; that is, the set of cross sections of the map h. Hence, in general, 
we might call [A the object of “cross sections” of the arrow h. 

Now return to the general case of any f: B’ — B. This arrow f is 
also an object (f) in the slice category C/B; moreover, an object over 
(f) is just a commutative square 


xX ——B 


| 


Bae 
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and this square is determined by X — B’; that is, by an object in C/B’. 
This correspondence is an isomorphism of slice categories 


(C/B)/(f) = C/B’, 


and pullback along f*: C/B — C/B’ = (C/B)/(f) is reduced to the 
previous case of an object [that is, the object (f)| in the cartesian closed 
category C/B. This proves Theorem 4. 

From this theorem we can conclude that pullbacks preserve colimits, 
in the following sense. 


Proposition 5. If B and B' are objects in a complete category 
C with pullbacks such that all the categories C, C/B, and C/B’ are 
cartesian closed, then pullback along any arrow f : B’ — B preserves all 
colimits which exist in C/B. 


Proof: Since pullback is a functor with a right adjoint, by Theo- 
rem 4, it must, like all left adjoints, preserve colimits. 


When colimits are preserved, as in this case, by pullbacks, we say 
that they are stable under pullback. 

Notice, incidently, for a category C with products, that a cocone 
in the category C/B is a colimit there iff the corresponding cocone in 
C (obtained by forgetting the arrows to B) is a colimit in C. Indeed, 
the forgetful functor U: C/B — C has a right adjoint B*: C > C/B 
(product with B), hence preserves colimits. Conversely, the fact that if 
the cocone yields a colimit in C then it was already a colimit in C/B, 
follows immediately from the universal property of the colimit. 

As a consequence, note also that a map 


C ———_+ C’ 
B 
in C/B is an epi there iff C > C" is an epi in C. Indeed, the square 


C ——>+ Cc’ 


i » 


CaO 


consists of maps over B, and is a pushout in C/B iff it is a pushout in 
C. But (9) is a pushout in either category iff C — C’ is an epi there. 


62 


I. Categories of Functors 


Exercises 


1. 


Show that pullbacks of epis are epi for categories of each of the 
types (i)—(ix). 


. Prove that FinSets has no subobject classifier. 
. For R aring, prove that the category R— Mod of left R-modules 


has no subobject classifier. 


. If A — B is an equivalence of categories, prove that a subobject 


classifier for A yields one for B, and that A cartesian closed 
implies B cartesian closed. 

(a) In BM = Sets” °° for M a monoid observe that an object 
X is a right action X x M — X of M on a set X and 
that, Y being another object, Hom(X,Y) is the set of 
equivariant maps e: X — Y [maps with e(xm) = (er)m 
for all zx € X,m € M]. Prove that the exponent Y* is the 
set Hom(M x X,Y) of equivariant maps e: M x X —Y, 
where M is the set M with right action by M, with the 
action e +> ek of k € M on e defined by (ek)(g,z) = 
e(kg, x). 
For objects X, Y in Sets?” for G a group, show that the 
exponent Y* can be described as the set of all functions 
f:X — Y, with the right action of g € G on such a 
function defined by (fg)x = [f(xg~!)]g for z € X. 


(b 


— 


. Let G be a topological group and BG the category of continuous 


G-sets. Let G® be the same group G with the discrete topology. 
So BG® = Sets?” is a category as considered in the previous 
exercise. Let ig: BG - BG* be the inclusion functor. 


(a) Prove that a G-set (X,u: X x G — X) is in the image of 
ig, ie., that u is continuous, iff for each x € X its isotropy 
subgroup 


I,={qgeG|x-g=z} 


is an open subgroup of G. 

Prove that, for a G°-set (X, 1) as above, the set rg(X) = 
{xz € X | I, is open} is closed under the action by G, 
and that re defines a functor BG® — BG which is right 
adjoint to the inclusion functor ig. 

Observe that ig preserves products, and conclude from (b) 
that BG is cartesian closed since BG® is. (Hint: define the 
exponential Y* in BG by re(icg(Y)'e™))] 


(b 


— 


(c 


— 
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7. In Exercise 6, show that the forgetful functor U: BG — Sets 
need not preserve infinite limits. 

8. Consider a small category C. For each object B of C there 
is a functor Dg: C/B — C defined by taking the domain of 
each arrow to B. Hence, each T: C°P — Sets yields Tg = 
T o D°P: (C/B)°P — Sets. Define an exponential T° by 


S = — 
T (B) = Hom @75) (Se. Ta), 


with the evident evaluations eg: T5(B) x $(B) — T(B). Show 
that T° with this evaluation e is indeed the exponential in the 
functor category C = Sets®” 
9. Let Q be the (linearly) ordered set of all rational numbers con- 

sidered as a category, while R* is the set of reals with a symbol 
oo adjoined. In Sets®, prove that the subobject classifier 9 has 
O(g)={r|rEeRt,r>gq}. 

10. Generalize Theorem 2 of Section 9 to presheaf categories. More 
precisely, prove that for a morphism (i.e., a natural transforma- 
tion) f: Z— Y in C= Sets©”, the pullback functor 


f*: Subg(Y) — Subs (Z) 


has both a left adjoint 3, and a right adjoint V;. [Hint: the left 
adjoint can be constructed by taking the pointwise image. De- 
fine the right adjoint Vs on a subfunctor S of Z by Vs(S)(C) = 
{y © Y(C) | for all u: DC in C and z € Z(D), z € S(D) 
whenever fp(z) = yu}.| 

11. Prove Proposition 5.1, that every functor P to sets is repre- 
sentable, by constructing for each P: C°P — Sets a coequalizer 


6 
LT yo LT vo—~p, 
= CEC 
peP(O) peEP(C) 


where |] denotes the coproduct and for each object B the maps 
are defined for each v: B — C or C’ as follows 


€B(C,p; v) = P(v)p, 9B (U,D; v) = (C,p; uv) TB(U; D; v) iz (C", pu; v). 


(Hint: For each B, this gives a split coequalizer, as defined in 
[CWM, p. 146].) 


II 
Sheaves of Sets 


This chapter starts with the notion of a sheaf F on a topological 
space X. Such a sheaf is a way of describing a class of functions on X— 
especially classes of “good” functions, such as the functions on (parts 
of) X which are continuous or which are differentiable. The description 
tells the way in which a function f defined on an open subset U of X 
can be restricted to functions f|y on open subsets V C U and then 
can be recovered by piecing together (collating) the restrictions to the 
open subsets V; of a covering of U. This restriction-collation description 
applies not just to functions, but also to other mathematical structures 
defined “locally” on a space X. 

Alternatively, a sheaf F on X can be described as a rule which 
assigns to each point x of the space a set F, consisting of the “germs” 
at x of the functions to be considered, as defined in neighborhoods of 
the point 2. The sets F, for all x can then be “pasted” together by a 
suitable topology so as to form a space (or bundle) projected onto X; 
an individual “good” function (for this sheaf) is then a “cross section” 
of the projection of this bundle. Viewed in this way, the sheaf F is a set 
F, which “varies” (with the point x) over the space X. 

The letter F is often used for a sheaf because in French the word for 
“sheaf” is “faisceau”. 

We will show that the category Sh(X) of all sheaves of sets on a 
given space X has all the properties listed in Chapter I for our “typical” 
categories (i.e., for topoi). Much of the subsequent development of the 
properties of topoi from the axioms is motivated by geometrical con- 
siderations from this case of sheaf theory. This chapter is intended to 
develop some of the sheaf-theoretic intuition behind this development. 

Readers familiar with sheaf theory might wish to skip this chapter; 
they should then note that we emphasize sheaves of sets, and not just 
those of abelian groups or of modules, and that a sheaf is defined here to 
be a suitable contravariant functor on open sets, and not the associated 
(étale) space of the sheaf, as described in §5 below. 
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A topology on a set X serves to define the continuous functions there; 
for example, the continuous functions from the space X to the reals R, 
or from any open set U in X to R. The continuity of each f: U —- R 
can be determined “locally”. This means two things: 


(i) If f: U — R is continuous and V C U is open, then the function 
f restricted to V is continuous, fly: V — R. 

(ii) If U is covered by open sets U;, and the functions f;: U; —- R 
are continuous for all 1 € I, then there is at most one continuous 
f: U — R with restrictions f|v, = f; for all 4; moreover, such 
an f exists if and only if the various given f; “match” on all the 
overlaps U;M U;, in the sense that fix = fjx for all c ¢ U;,NU; 
and alli, 7 in [. 


Property (ii) states that continuous functions are uniquely “collat- 
able”. 

Many other structures on a space X are “determined locally” in 
much the same sense. These properties (i) and (ii) can be conveniently 
expressed in terms of the function C’ which assigns to each open U Cc X 
the set of all real-valued continuous functions on U, 


C(U) = CU ={f | f: U —R continuous }. (1) 


For V Cc U, the operation of (i) restricting each f to the subset V, 
written as f > fly, is a function CU > CV, while if W CVC U are 
three nested open sets, restriction is transitive, in that (f|v)|w = flw- 
These two statements mean that the assignments 


UCU, {V CU} {CU SCV by fH fly} 


define a functor C’: O(X)°? — Sets. Here O(X) is the category with 
objects all open subsets U of X and arrows V — U the inclusions V c U. 
The statement that C’ is such a functor expresses property (i) above. 

As for property (ii) for an open covering U = (JU;, an I-indexed 
family of functions f;:; U; ~ R, i € I, is an element of the prod- 
uct set []; CU;, while the assignments {f;} +> { filu.nu, } and {fi} > 
{ f;lu.nv, } define two maps p and q of I-indexed sets to (I x I)-indexed 
sets, as in the diagram 


cu --£-» ][ cu, == [] cu; nu). (2) 
i iJ 


Then property (ii) above states that the map e given by f & {flu,} 
is the equalizer of the maps p and q (i-e., is the universal map e with 
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pe = qe). A sheaf will be defined below to be a functor C' such that (2) 
is an equalizer for all coverings U = (J Uj. 

We have described this particular C’' as a sheaf of sets; it is actually 
a sheaf of algebras over the field R or a sheaf of R-modules, because 
each set CU is an algebra over R. under pointwise sum, product, and 
scalar multiple, while the maps p, q and e of (2) are R-linear morphisms 
of rings. Hence, in this case, the statement that (2) is an equalizer is 
equivalent to the statement that the sequence of R-modules 


0——+cu —*> |[ cu; 7% [[cw:nu,) (3) 
i 449 


is left exact (i.e., that e is the kernel of p— q). 

There are many other examples of sheaves on a space X ; for example, 
the functor D with each D(U) the set of all functions, continuous or not, 
on U to R, or I(U), the set of all continuous functions on U to the unit 
interval I in R. However, the set B(U) of all bounded functions on U to 
R is a functor of U but not a sheaf, because the collation of functions 
which are bounded may yield an unbounded function. 

For the Euclidean n-space X = R” there are a number of examples 
of sheaves. For U open in R” let C*®U be the set of all f: U — R which 
have continuous partial derivatives of all orders up to order k inclusive. 
Then C* is a functor C*: O(X)°P — Sets with values in Sets or in 
R-Mod, and (2) 

with C replaced by C* is again an equalizer because differentiability 
is local. Thus, each C¥ is a sheaf on R”. This leads to a nested sequence 
of subsheaves on R”: 


Ch ersCC" COT eC arCereC Sc. 


We will regard a sheaf as a functor, that is, as a special kind of a 
presheaf. Here a presheaf of sets P on a topological space X is defined 
to be a functor P: O(X)°P — Sets; that is, a presheaf on X is the same 
thing as a presheaf on the category O(X), as defined in §I1.1. This means 
that each inclusion V C U of open sets in X determines a function 


P(V CU): PU = PV, (4) 
which we will often write for each t € PU as t+ tly, just as if it were 


restriction of an actual function t. Moreover, (¢|v)|w = tlw whenever 
WcecVcuU. 


Definition. A sheaf of sets F on a topological space X is a functor 
F, O(X)°? — Sets such that each open covering U = J; Ui, i € I, of 
an open set U of X yields an equalizer diagram 


FU --£-5 [I Fu. ser Il F(U; U3), (5) 
a ) 
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where for t € FU, e(t) = { t|y, |i €¢ I} and for a family t; € FU;, 


P{t.} = {til,rv,)}, ti} = {tlw.nu,)}- 


A morphism F — G of sheaves is a natural transformation of func- 
tors. Sh(X) will denote the category of all sheaves F of sets on X, with 
these morphisms as arrows; so, by definition, Sh(X) is a full subcate- 
gory of the functor category O(X) = SetsO*%)"”, A separated presheaf 
is a functor F, as above, such that the map e in (5) is injective, (ie., 
a monic in Sets), though not necessarily the equalizer of p and g. For 
example, the functor B, with each B(U) the bounded real-valued con- 
tinuous functions on U, is a separated presheaf but not a sheaf. 

Since an arrow into a product is determined by its components (its 
composites with the projections of the product), the maps e, p, and q of 
the diagram (2) are the unique maps which make the diagrams below 


FU; LEU OU CU) F(U;NU;,) 

FU ---£--+ [[ Fu: era [[Fu: NU,) (6) 
i a5 
FU; Fonieu,y PU: NU;) 


commute for all i, 7 € I, where the vertical maps are the (various) pro- 
jections of the products in question. This categorical description of the 
equalizer diagram means that our definition applies with Sets replaced 
by other suitable categories, and so defines sheaves F': O(X)°P — C of 
C-objects on a space X, where C is any category with all small products. 
The classically useful cases are sheaves of abelian groups, of rings, and 
of R-modules and R-algebras, for various rings R. Sheaves of modules 
are important as coefficients for the cohomology of a space. 

Note that the definition of a sheaf implies that every sheaf F must 
send the empty set @ onto a one-point set {*}. For, in any space X the 
empty open set § has an empty cover (J = ); since a product J], over 
an empty index set I is the one-point set {«}, the equalizer (5) becomes 
F(Q) + {*} {+}, so F(0) = {*}, as asserted. 

A subsheaf of a sheaf F on X is defined to be a subfunctor of F 
which is itself a sheaf. The local character of a sheaf is exhibited by the 
following description of a subsheaf: 


Proposition 1. [f F is a sheaf on X, then a subfunctor S C F 
is a subsheaf if and only if, for every open set U and every element 
f € FU, and every open covering U = JU,, one has f € SU if and only 
if fly, € SU; for alli. 
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Proof: The stated condition is clearly necessary for S to be a sheaf. 
Conversely, consider the commutative diagram 


SU ——— [[ su: SS [1s NU;) 


I || 


FU [[ 7 == [[F@nv,), 


with vertical maps monic and bottom row an equalizer. The last con- 
dition of the proposition states precisely that the left-hand square is a 
pullback. It follows by a diagram chase that the top row is an equalizer. 

If f: X + Y is a continuous map of spaces, then each sheaf F on X 
yields a sheaf f, F on Y defined, for V open in Y, by (f, F)V = F(f7!V); 
that is, f,F is defined as the composite functor 


ory) £7, ocx)? ©. Sets. 


This sheaf f, F is called the direct image of F under f. The map f, so 
defined is clearly a functor 


fr Sh(X) — Sh(Y). 


Also (fg)« = fxg+, So the definition Sh(f) = f, makes Sh a functor on 
the category of all small topological spaces. In particular, if f: X + Y 
is a homeomorphism, f, gives an isomorphism of categories between 
sheaves on X and sheaves on Y. 

Let U be an open set in the space X. Any sheaf F' on X, restricted to 
open subsets of U, is clearly a sheaf F|y on U. In this way, U + Sh(U) 
and U D> V } (Fly +} Fly) define a contravariant functor on O(X). In 
fact, since the notion of a sheaf is “local”, this functor is itself almost a 
sheaf: 


Theorem 2. If X =|) W,, is an open covering of the space X, and 
if, for each k, Fy is a sheaf of sets on W;, such that 
Frlw.nw. = Felcwenwe) (7) 


for all indices k and £, then there exists a sheaf F on X, unique up 
to isomorphism, with isomorphisms F'|w, = F), for all indices k, which 
match on the equation (7). 


Proof: Write Fi,¢ for the sheaf (7) 
on Wi. Ws. If the desired sheaf F exists, then for each open U one 
must have an equalizer 


Fu —>|[ R(UNW:,) => |] Fe Wen We). (8) 
k ke 
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Take this as the definition of each set FU. If U > V, a comparison 
with the equalizer definition of FV gives a unique map FU —> FV, 
and, with these maps as the restrictions, F is a functor on O(X)°?, 
so is a presheaf. To prove it a sheaf, consider any covering U; of U 
and construct the commutative 3 x 3 diagram with first column FU > 
[1 £Ui 3 [] F(UinU;) and with rows the definitions, like (8), of FU and 
FU;,. Then all these rows and (F;, being a sheaf) the last two columns are 
equalizers. A simple diagram chase resembling that for the 3 x 3 lemma 
(see [Mac Lane, 1963, Lemma XII.3.3]) then proves that the left-hand 
column is an equalizer, so that F is indeed a sheaf. Uniqueness up to 
isomorphism is evident from the similar uniqueness of the equalizer (8). 

Many explicit sheaves can be constructed from the local pieces Fy, 
according to the instructions contained in the proof of this theorem. A 
more liberal version of Theorem 2 is stated in Exercise 8. 

Another method of constructing sheaves on a space X is given by 
Theorem 3 below. Let B Cc O(X) be a basis for the topology on X. So 
for any point z € X and any open set U containing z, there is a basic 
open set B € B with g € B CU. Moreover, we shall assume that B is 
closed under finite intersections (but this is not strictly necessary, cf. the 
Appendix, §4). B can be viewed as a full subcategory of O(X), so it 
makes sense to speak of presheaves on B, i.e., functors F': B°P — Sets. 
Such a functor F is called a sheaf on B if for any basic open set B € B 
and any open cover B = L),.; B; of B by basic open sets B; € B, the 
diagram 

F(B)>—> |] F(8) =3 [| F698), (9) 
i ins 
analogous to (5), is an equalizer diagram. A morphism of sheaves on B 
is a natural transformation; so one obtains a category Sh(B) of sheaves 
on B. Clearly, any sheaf F': O(X)°? — Sets on X restricts to a sheaf 
on B, and this process defines a functor r: Sh(X) — Sh(B). 


Theorem 3. For a basis B of the topology on a space X, the re- 
striction functor r: Sh(X) — Sh(B) is an equivalence of categories. 


More informally, this theorem says that a sheaf F on X, or a map 
7: F -+ G between sheaves on X, may equivalently be defined by spec- 
ifying the values F'(U), or the components vy, only for basic open sets 
U. This theorem is a special case of a result in the Appendix. We leave 
a direct proof to the reader as Exercise 4. 
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On any space X, each open set U determines a presheaf Hom( — , U) 
defined, for each open set V, by 
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1 f#V CU 


H V,U)= 
om) e otherwise, 


(1) 
where 1 is the one-point set. This presheaf is clearly a sheaf; it is the 
representable presheaf y(U) = Hom(—,U) on the category O(X). Re- 
call from 81.4 that a sieve S on U in this category is defined to be a 
subfunctor of Hom(—,U). Replacing the sieve S by the set (call it S 
again) of all those V Cc U with SV = 1, we may also describe a sieve 
on U as a subset S C O(U) of objects such that Vo C V € S implies 
Vo € S. Each indexed family { V; C U | i € I} of subsets of U generates 
(= “spans”) a sieve S on U; namely, the set S consisting of all those 
open V with V c V; for some ?; in particular, each Vo C U determines 
a principal sieve (Vo) on U, consisting of all V with V Cc Vo. It is not 
difficult to see that a sieve S on U is principal iff the subfunctor S of 
y(U) is a subsheaf (Exercise 1). A sieve S on U is said to be a covering 
sieve for U when U is the union of all the open sets V in S. 

In the definition of a sheaf, we may replace open coverings by cov- 
ering sieves, as follows: 


Proposition 1. A presheaf P on X is a sheaf if and only if, for 
every open set U of X and every covering sieve S on U, the inclusion 
ig: S — yU of functors induces an isomorphism, 


Hom(yU, P) = Hom(S, P). (2) 
(Here each Hom is the set of natural transformations. ) 


Proof: For any presheaf P on the space X and any covering of an 
open set U by U;, we can construct the equalizer EF in the diagram 


£E—4.][Pu = 3 [[ Pung). 


i,j 


Specifically, E consists of those families of elements 1; € PU; with 
zilu,nu, = £j|u,nu, for all pairs of indices (7, 7). Now replace the cover- 
ing U; by the corresponding sieve $, consisting of all open sets V with 
V c U; for some i, and for each V define xy to be zi|y. By the as- 
sumption that the 2; match on intersections U,M U;, the zy so defined 
are independent of the choice of the index i with V Cc U;. Therefore, 
the equalizer EF can be described as the set of those families of elements 
zy € PV for V € S with ey|y: = xy whenever V’ C V. Now regard 
S as a functor O(X)°? — Sets with SV = 1 for those V € S and 
SV = @ otherwise. Each element xy € PV is then a map SV — PV, so 
the equalizer E is now described as the set of natural transformations 
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6: S + P [where @y(1) is zy]. Next use the inclusion ig: S — yU to 
form the diagram 


Hom($, P) —4— [] Pu; =3 [] PU: 95) 
a 4,9 


(és)* | (3) 


Hom(yU, P) —— PU, 


where y is the isomorphism given by the Yoneda lemma, while the maps 
e, p, and qg are described as before in (1.2), and the equalizer d is the 
function which sends each natural transformation 0: S — P to the fam- 
ily @y,(1) € PU, of its values, for i € I. 

For this diagram, one verifies that the square in the middle always 
commutes, so that e does, in fact, always factor through the equalizer d 
of p and q. Therefore, P is a sheaf (i.e., e is the equalizer) if and only if, 
for every covering U;, the left-hand vertical map (ig)* is an isomorphism, 
where S is the corresponding covering sieve. 


This proposition has the theoretical advantage of describing sheaves 
wholly in terms of objects (presheaves and sieves) of the category of 
presheaves. It also slightly simplifies some proofs of facts about sheaves. 
Moreover, it will be used as a definition of sheaves in terms of a more 
general notion of covering (Chapter III). 

As said before, the category Sh(X) for the space X is a full subcat- 
egory of the functor category (the category of presheaves) Sets°™”; 


Sh(X) > O(X) = Sets?” (4) 


We will soon see (§5) that this inclusion functor has a left adjoint. This 
will imply the second part of 


Proposition 2. For any space X the category Sh(X) has all small 
limits, and the inclusion of sheaves in presheaves preserves all these 
limits. 

A direct proof is easy. First consider equalizers. Given two maps 
F' 3G of sheaves, take their equalizer E + F 3 G as presheaves. Now 
for any presheaf P the hom-functor Hom(P, — ) preserves limits (and in 


particular, equalizers). Hence for any covering sieve S on U, the rows 
of the commutative diagram 


Hom(yU, E) ———> Hom(yU, F) ——> Hom(yU, G) 


| | | 


Hom(S, £) -——— Hom(S, F) —=——> Hom(S, G) 
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are equalizers in Sets. The vertical maps are those induced by the 
inclusion ig: S — yU and the diagram commutes (for two squares on the 
right, the square with both upper horizontal arrows or both lower). Since 
F and G are sheaves, the two right-hand vertical maps are isomorphisms. 
Our diagram is now like a map of two left exact sequences of modules, 
and a simple diagram chase, like that used to prove the well-known “five 
lemma” of homological algebra, shows that the left-hand vertical map is 
an isomorphism. Therefore, E is a sheaf; since E — F is the equalizer 
in presheaves, it is immediate to verify that it is also the equalizer in 
sheaves. 

This argument has used the second description of sheaves by condi- 
tion (2); the proof can be given in terms of the original covering descrip- 
tion by means of a bigger (3 x 3) diagram. Similar arguments produce all 
other small limits in Sh; indeed it is enough [CWM, p. 109] to construct 
all equalizers (as above) and all small products; in particular, an argu- 
ment like that above shows that the pointwise product of two sheaves 
is a sheaf and that the terminal presheaf 1 is a sheaf. In each case, 
given a diagram in Sh(X), we regard it as a diagram in SetsO% ye. we 
take the limiting object in Sets°(* We show that it is actually a sheaf, 
and conclude that this sheaf, with its limiting cone, is also the limit in 
Sh(X). In the terminology of [C'WM, p. 108], this amounts to showing 
that the inclusion (4) of sheaves in presheaves creates limits. A similar 
result for a more general case will appear in Proposition III.4.4. 

Recall that a subsheaf of a sheaf F’ was defined in §1 to be a sub- 
functor of F which is itself a sheaf. 


Corollary 3. A subobject of a sheaf F in the category Sh(X) is 
isomorphic to a subsheaf of F. 


Proof: Let the given subobject be represented by an arbitrary 
monic m: H>— F in Sh(X). Now in general, m is monic iff m fits into 
a pullback square 


in Sh(X) (cf. §1.2). Hence, by Proposition 2, this square is a pullback in 


the category O(X) of presheaves on X. But pullbacks in this category 
are computed pointwise, so m is a pointwise monic map. Hence, each 
set H(U) is isomorphic to a subset $(U) of F(U), so H is isomorphic to 
the subfunctor S of F (and this subfunctor is necessarily a sheaf since 
HH is). 
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Next, observe that open sets are just the subsheaves of 1. 


Proposition 4. For any space X, there is an isomorphism 
O(X) = Subsh(x) (1) (5) 


of partially ordered sets (in fact, of Heyting algebras); here 1 is the 
constant sheaf 1 = Hom(—, X) as in (1) above. 


Proof: Given any open set W of X, define a functor Sy on open 
sets U by Sw(U) =1if U C W and Sw (UV) = 0 otherwise. This functor 
is clearly a sheaf, so it defines a subsheaf of 1. Conversely, let S be 
a subsheaf of 1 (by the corollary, any S — 1 can be thus represented). 
Each S(U) is then either 1 or 9. Since S is a functor, S(U) = 1 for some 
U and V CU imply S(V) = 1. And by the equalizer condition (5) of 
§1, if {U; | i € I} is an open cover of U and SU; = 1 for all i, then 
SU =1. Thus, if we let W = U{U € O(X) | SU = 1}, then for all 
open sets U, SU = 1 iff U C W. That is, S = Sw. Thus, W » Sy 
is the desired bijection O(X) & Sub(1). It is clearly order-preserving, 
hence an isomorphism of partially ordered sets. 


This result shows that the partially ordered set of the open subsets 
of a topological space X can be recovered from the category Sh(X)—as 
the set of all subobjects of the terminal object 1 of Sh(X). In this sense, 
the category of sheaves of sets on a space X determines the topology 
of X. 


3. Sheaves and Manifolds 


The purpose of this section is to give some examples of sheaves arising 
in the context of manifolds. The specific properties of manifolds and the 
related sheaves will not be used at other places in this book, and the 
reader who wishes to do so may skip this section. 

The invariant description of manifolds may be suggested by the case 
of the 2-sphere S?. The sphere is initially described as the subset x? + 
y? + 2? = 1 in R®, but it can also be described intrinsically, without 
reference to the ambient space R°?. Omitting the north pole n € S?, the 
stereographic projection is a homeomorphism ¢: S? — {n} — R?, and 
similarly w: S? — {s} — R? for s, the south pole. We get all of S? by 
taking these two homeomorphic copies S? — {s} and S?— {n} of R” and 
pasting them together along the common part $*— {s,n}. Moreover, we 
can test a function on S? for continuity, differentiability, etc., by testing 
it separately on each of these two parts. In this way, using Theorem 1.2, 
we get the sheaf of continuous functions on $*, the sheaf of differentiable 
functions on S?, and hence the various smooth structures on this sphere. 
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This exemplifies the way in which manifolds and functions on them can 
be constructed by pasting together Euclidean pieces. 

A topological n-manifold M is a space which is locally like R”; this 
suggests that manifolds involve sheaves; they do. Specifically, an n- 
dimensional manifold M (see [Dold, Chapter VIII]) is a second count- 
able Hausdorff space such that each point g € M has an open neighbor- 
hood V homeomorphic to an open set W C R”. Such a homeomorphism 
@: V >W CR’ iscalled a chart for M; moreover, a function is contin- 
uous on V C M when its composite with ¢~! is continous on W C R”; 
in this way, the chart determines the sheaf Cy of continuous functions 
on V as the direct image Cy = (¢~1),Cw. In particular, the n coordi- 
nate projections R” — R, restricted to W and composed with ¢, yield 
n coordinate functions 7,...,0,: V — R called the local coordinates 
for the chart ¢. Conversely, these n functions determine the chart, as 
that continuous map V — R” which has the components 7;: V > R. 
(This map to R” is then restricted to its image W in R”.) 

An atlas for M is an indexed set {6;: Vi; + W;} of charts such that 
the domains V; cover M. Any such atlas determines M as a topological 
space. Two charts ¢; and ¢; of an atlas may “overlap” on the set V;NV;, 
as in the diagram 


W; Yio OV; Wj 
6-®=H 


The chart ¢; gives by composition with inclusion V; NV; Cc Vi > 
W; a homeomorphism ¢;;: V; 1 V; & Wi; from the overlap to some 
open set W;; C W; C R”, and @; gives a (different) homeomorphism 
31: Vi NV; = Wy; to a different open set Wj; C W; of R”. Thus, 
for each ordered pair of indices (i, 7) we have a composite “transition” 
function 


b3ibi3 1:Wi3 Wi, =, VET (1) 
mapping one open set of R” into another one, homeomorphically. 


Pictorially, Mf may be obtained by taking all the open sets W; of R” 
and pasting the Wij C Wi to W3i C Wj together by the “transition” 
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functions (1). Formally, consider the diagram 


Lvov Tu, 0) 


iJ a 


where |]; designates the coproduct (disjoint union) in Top, y sends 
each point x € V; to the same x € M, while a (or @) sends each point 
riz in V; NV; to the same zi in V; (or Vj). It is immediate to prove 
that M is the coequalizer of a and ( in this diagram in the category 
Top of topological spaces. The parallel to the definition of sheaves is 
immediate. 

Smooth manifolds are treated similarly. Take “smooth” to mean C* 
(ie., with k continuous derivatives) for some k, possibly k = oo. Then 
for each open set W in R” there is the set C*(W) of all smooth functions 
W — R and hence the sheaf C&, of all smooth functions to R. defined 
on open subsets of W. For two open sets W and W’ in R” a function 
WV: W > W’ is said to be smooth iff every composite goV with a smooth 
real-valued function g: W’ — R is smooth; it is sufficient to require 
that all the composites x; o V with one of the coordinate functions 2;, 
i=1,...,n are smooth. For the corresponding sheaves this means that 
every f: Wi — R smooth on an open subset W 4 of W’ has a composite 
fo(¥|w,) smooth on Wo = Y~!W4. In other words, composition with 
gives a map Ck,, + &,(CK,) of sheaves on W’. A smooth n-dimensional 
manifold is now defined to be a topological n-manifold M with an atlas 
such that all the transition functions bbz" of (1) are smooth. This 
definition is not “invariant” because there are many choices of an atlas 
for M; however, each smooth atlas can be enlarged to a maximal such 
smooth atlas, and this maximal atlas is invariant. 

The homeomorphism ¢; serves to transfer “smooth” on W; C R” to 
“smooth” on V; C M; this gives the sheaf C* of all smooth functions 
on (open subsets of ) V;. The required smoothness conditions on the 
transition functions insure that the sheaves C* and Ck agree when re- 
stricted to the overlap V; V;. Therefore, just as in Theorem 1.2, the 
sheaves C} can be collated to give the sheaf C* of all smooth functions 
on (open subsets of) M. It is a subsheaf of the sheaf C of continuous 
functions, moreover, its restriction to each V, is the sheaf CF. Thus, 
each smooth manifold carries a structure sheaf, its sheaf C* of smooth 
functions. Like C* on R”, it is not just a sheaf of sets, but a sheaf of 
R-algebras (in particular, a sheaf of rings). 

A smooth n-manifold M can now be redefined as a second countable 
Hausdorff space together with a subsheaf S = Sy, of the sheaf Cy of 
continuous functions on M with the property that each point p € M 
has an open neighborhood V and n functions 71,...,2, € SV such that 
the map ¢: V — R” with components the x; is a homeomorphism to 
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an open set W Cc R”, and such that this homeomorphism carries the 
sheaf C* in W isomorphically onto Sly [i.e., so that f is smooth on V, 
meaning f € S(V), if and only if f¢~! is smooth on W]. This sheaf- 
theoretic definition of a manifold has the advantage that it is invariant 
and that it exhibits directly the smooth structure in terms of all the 
smooth functions (on all open subsets). 

If M and N are smooth manifolds, a smooth map h: M > N isa 
continuous map such that f smooth on an open subset V of N implies 
fh smooth on h-!V Cc M. In other words, the map Cy — h,(Cy,) 
of sheaves on N given by composition with h sends the subsheaf Sx 
into hy (Sy). We write Homco.(M,N) for the set of all such smooth 
h: M —» N; with these hom-sets, the set of all smooth manifolds is 
a category. Any open subset of a smooth manifold is again a smooth 
manifold, and the coequalizer diagram (2) can be reinterpreted as a 
coequalizer diagram in the category of all smooth manifolds. For fixed 
M and N, the assignment 


U ++ Homex(U, N), U open in M, 


is a presheaf on M which is actually a sheaf because the smoothness of 
h: U — N can be tested on the individual sets of any open covering of 
U. It is called the sheaf (of germs) of smooth maps of M to N. 

Many other basic constructions used in the geometry of manifolds 
lead to sheaves. An example is the tangent bundle for a C™ manifold. 
It may be constructed from the tangent vectors to “paths” in M. At 
each point g € M we consider simultaneously the smooth functions 
f: V — R defined in some open neighborhood of g and the smooth 
paths h: R — V in that neighborhood which pass through q, with 
h(0) =q. Then fh: R — Ris smooth, so has at 0 € Ra first derivative 
d( fh)/dt|,.,. Consider the resulting pairing 


d( fh) 

(fag = a in (3) 
and define the equivalences f = f’ at q to mean that (f,h), = (f’,h)q 
for all h, and h =h’ at qg to mean that (f,h)g = (f,h’)q for all f. The 
resulting equivalence classes of functions f form a real vector space T% 
(under addition and scalar multiples of functions). An element of this 
space T? = T7M is called a cotangent vector at the point g; in particular, 
each function f determines such a vector dgf. The equivalence classes 
of paths h are called tangent vectors 7 at q, so that each smooth path h 
through g has a tangent vector at g, say ™. By the pairing (3) above, 
each tangent vector 7 determines a linear map 


L,:T% +R, L,, (dqf) = (fi h)g. 
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In this way, the set T, of all tangent vectors at q is isomorphic to the 
set of all linear maps T? — R; that is, to the dual of the vector space 
T1. As the (linear) dual of a vector space, the tangent space T, is thus 
itself a vector space. By (3) and the Leibniz rule for the derivative of a 
product, the map L = L, satisfies a corresponding product rule 


L( fg) = (fala + (gq) Lf (4) 


for all f, g € SU. A tangent vector at g can be defined to be a suitable 
linear map L with this property; we have not followed the custom of 
using this definition because we prefer the above treatment in which 
tangent and cotangent vectors enter in an even-handed way. 

In local coordinates 7,,...,¢n, (and with the usual local coordinate 
t on R) the familiar formula for the derivative of a composite function 
expresses the pairing (3) as 


(ihe = (SE) (GS), 


where the partial derivatives are evaluated at q and at t = 0. Each 
cotangent vector dgf at q thus has n (real) coordinates (Of/Ox;)q. This 
gives n coordinate functions, written 0/02,,...,0/02n, for every cotan- 
gent space T? with q € V, and n dual coordinates dz,,..., dx, for every 
tangent space T,; both spaces are n-dimensional and the formula above 
is essentially the usual one for the differential df. 

The picture is completed by putting all the tangent spaces together in 
one bundle; thus the tangent bundle TM is the disjoint union ][, T,M in 
the category of sets of all the tangent spaces, together with the projection 
p: TM — M sending each tangent vector to the point g at which that 
vector is defined. The tangent bundle as a set is locally a product 
because over the open base V of a smooth chart ¢: V + W Cc R”, 
the function ¢ induces a bijection ¢,: pp-!'V = W x R”. If 2,...,2n 
are local coordinates for V, one then has (1)p,...,@np,d21,...,d%n) as 
2n local coordinates on p-1V. Take the open sets on TM necessary 
to make all such local coordinates continuous; these charts make TM 
& topological 2n-manifold. If “smooth” means C’°, then choosing the 
¢1 as the smooth charts makes TM a C™®-manifold and p: TM > M 
smooth. Moreover, each “fiber” p~'{g} of the smooth projection map p 
is a vector space TM (i.e., the tangent space at q). With this structure, 
p: TM — M is a vector bundle in the sense defined in §4 below. 

With this notion of tangent bundles, one has a direct definition of 
“vector fields”. A vector field X on an open set U of the smooth manifold 
M is a smooth map X: U ~ TM such that the composite pX is the 
identity (more exactly, is the inclusion U C M). Such a map X is also 
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called a cross section over U of the bundle p: TM — M. This definition 
agrees with the intuitive idea that a vector field attaches to each point 
q € M a tangent vector X, at q, in such a way that this vector “varies 
smoothly” with g. The study of vector fields is a notable branch of 
topology, as witness the famous theorem that there is no nonvanishing 
continuous vector field on S* and the famous calculation of the exact 
number of linearly independent vector fields on S” for all n [Adams, 
1962]. The function sending each U to the set FU of all vector fields on 
U is a sheaf on M, the sheaf of vector fields on M. It is a subsheaf of 
the sheaf of all smooth maps M > TM. 

The cotangent bundle T* M is another smooth manifold, constructed 
in the corresponding way from the cotangent spaces T7M. A differential 
1-form w on an open set U in M is again defined as a cross section: A 
smooth map w: U + T*M such that the composite U + T*M + M 
is the identity. These form a sheaf of differential 1-forms on M. Each 
smooth f: M —+ R determines a particular 1-form df, defined as that 
function which sends each q to the congruence class d,f (as defined 
above) of f in T7M. If in this construction each cotangent space T7M 
is replaced by its p*® exterior power, one obtains the bundle and the 
sheaf of differential p-forms. Similarly, tensor products of T?7M and 
T,M yield sheaves of tensors on M. 

Other types of manifolds may be treated similarly, the most impor- 
tant being the case of complex analytic manifolds. Let C be the field 
of complex numbers. If U is open in C”, a function h: U > C is holo- 
morphic on U when it has a convergent power series expansion in some 
neighborhood of each point of U. Thus, h holomorphic on U and V CU 
implies that h|y is holomorphic on V. Moreover the notion “holomor- 
phic” is local: h is holomorphic on U if and only if it is holomorphic on 
every set U; of some open covering of U. These properties define a sheaf 
Hon C” 


HU ={h:U +C|h holomorphic }. 


It is clearly a sheaf of rings (and of C-algebras). 

Starting from this, one defines a complex analytic n-manifold M: A 
real 2n-manifold with an atlas ¢;: U; ~ W; Cc C” for i € I for which 
all the transition functions are holomorphic. Each such manifold carries 
then as structure sheaf the sheaf of holomorphic functions. This case 
differs from that of smooth manifolds chiefly because two holomorphic 
functions h, k: D — C ona connected open set D of the complex plane 
equal on some open subset of D are equal on all of D. 

Both smooth and complex analytic manifolds are examples of ringed 
spaces. A ringed space X is a topological space equipped with a fixed 
sheaf R of rings, called the structure sheaf. A morphism f: (X,R) — 
(X’, R’) of ringed spaces is a continuous map f: X — X’ together with 
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a homomorphism a: R’ -+ f,(R) of sheaves of rings (see §7). (In the 
examples above, @ was given by composition with f.) These spaces 
with these morphisms form a category, useful in treating “manifolds 
with singularities”. In this way, additional structure on a space is often 
presented in terms of a “structure” sheaf on that space. 


4. Bundles 


For any space X, a continuous map p: Y — X is called a space over 
X or a bundle over X. Put differently, these bundles are the objects of 
the slice category Top/X, while an arrow f: p— p’ of this category is 
a continuous map f: Y > Y’ with p’f =p. A cross-section of a bundle 
p: Y — X is a continuous map s: X — Y with ps = 1; that is, it is an 
arrow from the identity X — X to Y — X in the category Top/X. For 
each « € X, the inverse image p~!2 is called the fiber of Y over x. It 
is convenient to think of a bundle as the indexed family of fibers p'z, 
one for each point x € X, “glued together” by the topology of Y. 

If U is an open subset of the base space X of a bundle p: Y — X, 
then p restricts to a map py: p-'!U —» U which is a bundle over U; 
moreover, the square diagram 


p 'U——+Y 


a 
8 ue 
Pu see Pp 
os 


U ———+ X 


with horizontal arrows the inclusions, is a pullback diagram in Top. A 
cross-section s of the bundle py, also called a cross-section of the bundle 
p over U, is a continuous map s: U - Y such that the composite ps is 
the inclusion 1: U — X. Let 


T,U ={s|s:U >Y and ps=i:U CX} 


denote the set of all such cross-sections over U. If V C U, one has a 
restriction operation ,U — T,V, so P',( — ) defines a functor O(X)°? + 
Sets. Also, one may test “locally” whether or not a given function s on 
U is a cross-section. Hence, I, is a sheaf of sets on X, called the sheaf 
of cross-sections of the bundle p (one often writes TY for T',). In this 
way, every bundle over X leads to a sheaf on X. Also, each map p — p’ 
of bundles over X induces a map Ip — Ty of sheaves on X, so T is a 
functor from bundles to sheaves. In the next section, we will see that 
every sheaf on X can be so regarded as a sheaf of cross-sections of some 
bundle. In the present section we will simply exhibit some examples of 
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bundles and their sheaves of cross-sections: sheaves over discrete spaces, 
vector bundles, principal bundles, associated bundles, and étale bundles. 

First, a discrete example: Let F' be a sheaf on a space X which is 
discrete (every subset of X is open). Then each one-point set {x} is 
open, so F' determines a function f: X — Sets by fa = F({z}). Any 
open subset U is covered by the sets {x} for x € U, so 


ru —— [J F({z})) = 1 
x2eU 


is an equalizer, and therefore FU = ||, ¢y fx. The space 


Y= [| f¢>x 


rex 


with the discrete topology, and the projection p sending each w € fz to 
xz € X, is then a (discrete) bundle over X with fibers fr. Moreover, FU 
is the set of cross-sections of p over U C X. Any function f: X —> Sets 
determines a discrete bundle over X in this way, and a sheaf of cross- 
sections. [One obtains an equivalence of categories Sh(X) = Sets/X, 
which is nothing but the familiar equivalence from §1.1(9).| 

If X is any topological space and L is any real vector space, regarded 
as a topological space, and if X x L has the usual product topology, the 
projection X x L -—» X is a bundle over X, called a product vector 
bundle. A cross-section is just a continuous map X — L. A (real) 
vector bundle Y over X is defined to be a bundle p: Y — X which is 
“locally” a product bundle of vector spaces in the sense that 


(i) For each x € X, the fiber p~1z is a real vector space. 

(ii) Each point z € X has an open neighborhood V for which there 
is a real vector space L and an isomorphism ¢ in Top/V, linear 
on each fiber, as in 


Y > ptv—2+vxL 


‘a 


XD V=>— Vp. 


For a smooth manifold M, the tangent, cotangent, differential form, 
and tensor bundles are all real vector bundles in this sense, with isomor- 
phisms (1) given by local coordinates. 

In a vector bundle Y — X, the vector space operations of addition 
and scalar multiple are continuous. For the scalar multiple, this simply 
means that the map RxY -—> Y given by (r,y) — ry, the scalar multiple 
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by r € R in the fiber, is continuous. For the operation of addition, more 
must be said because the sum y, + ye is defined only for two points y) 
and y2 on the same fiber. To say this, construct the pullback Y xx Y 
of p over p in Top 


Y xx ¥Y ———+ Y ¥x¥v¥ —_y 
a p p (2) 
Y —,—X, xX ———- X. 


Its points are precisely the pairs (41, y2) with py, = py; that is, the 
pairs which can be summed. It is a space over X, as indicated by 
the dotted arrow, while vector addition is, as displayed at the right, a 
function +: Y x xY — Y of spaces over X. Because of the isomorphisms 
(1), it follows that this function + is continuous, i.e., defines an arrow 
in the category Top/X. 

Let G be a topological group. A (continuous) right action of such a 
group G on a space Y (a G-space) is a continuous map a: Y x G—> Y 
with the usual (associative and unit) properties. Two points y, y’ in Y 
are equivalent in Y (under the action of G) when y’ = yg for some g € G; 
the quotient space Y/G is the set of equivalence classes (or orbits) of 
points of Y, with the quotient topology. The function p sending each y 
to its orbit gives a bundle p: Y ~ Y/G = B. Let Y xg Y denote the 
pullback of p over p. Now Y x G has two maps to Y, by projection and 
by the given action. These two maps combined as 0(y,9) = (y,yg) 
yield a continuous map @ to a pullback, as in 


¥y 6 2.) 4 Y = V 


yb 


Y —,—B. 


Then p is called a principal G-bundle (and the action of G on Y is called 
8 principal action) when 6 is a homeomorphism Y x GY Y xp Y. The 
condition that 6 is a bijection can be split into two: First, yg = yh 
for 9, h € G implies g = h; we say that G acts freely. Second, for 
& pair of points y, y’ in the same fiber of Y there exists an element 
9 © G with yg = y’; we say that G acts transitively on the fibers; then 
each fiber is homeomorphic to G. For example, the Stiefel manifold is 
& principal O;-bundle for the orthogonal group O;, with base space the 
Grassmann manifold. For further examples we refer to [Husemoller] 
and [Steenrod]. 

From a principal bundle with fiber G one can construct another 
bundle with fiber any given G-space F’, as follows. Let b: Gx F > F be 
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a continuous left action of the group G on a space F’. Given a principal 
G-bundle p: Y — B as above, form in Top the coequalizer e as in the 
diagram 08 
YxGx FI—43Y x F——-+ Y @coF 
n| p (4) 
Y= 

Thus e identifies in Y x F all pairs (yg, f) = (y,gf) foryeU, 9g €G, 
f € F; its codomain is written Y @g F to suggest the analogy with a 
tensor product Y @pr F of (right and left) modules over a ring R. Here 
py is the projection of Y x F on its first factor Y, so that the composite 
pp; also coequalizes a x 1 and 1 x b in the diagram. Therefore, there 
exists a unique continuous map p’: Y ®g F — B which make the square 
in (4) commute, and the fiber of the bundle p’ is G@g F = F. This 
bundle p’ is called the bundle with fiber F' associated to the principal 
bundle p. We refrain from developing further properties. = 

Another type of bundle is a “covering”. A covering map p: X + X 
is a continuous map between topological spaces such that each x € X has 
an open neighborhood U, with z € U c X, for which p~'U is a disjoint 
union of open sets U;, each of which is mapped homeomorphically onto 
U by p. Thus a covering map is a local homeomorphism in a strong 
sense. For example the 1-sphere S! (circle) has, for each n > 0, an n- 
fold covering by itself, where p: S! — S$? is e?™? 2rin® so winds the 
first circle n times around the second one; the real line covers the circle 
by p: R-> S!, t 4 e?*"t; in this case any open connected U ¢ S!, not all 
of S$’, has p~'U the union of denumerably many copies of U. Similarly, 
the plane R. x R covers the torus S’ x $1. A covering X is said to be 
universal if X is simply connected. A connected space X has a universal 
covering X — X provided X is locally simply connected in the large (to 
each point x € X there is a neighborhood U such that any closed path 
in U contracts in X to a point). All other connected coverings X can 
be obtained as quotients of the universal one; see [Massey]. 

The consideration of holomorphic functions H of one complex vari- 
able also leads to a bundle and to the notion of a Riemann surface. A 
function h holomorphic in an open set U of the complex plane C deter- 
mines at each point a € U a series h(z) = S7° 9 cn(z — a)”, convergent 
in some circular disc |z — a| < r about a; moreover this power series 
in turn determines the whole function h. Now form a space R whose 
points are the pairs (a, >c,(z — a)") consisting of a point a € C and 
a power series (with complex coefficients c,) converging in some non- 
trivial circular disc about a. Let Il: R + C map this pair to a € C. 
Topologize R by taking the following basis of open sets N: To each point 
(a,h = >> c,(z—a)”) of R and to each disc |z — a| < r about @ in which 


ree 
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the power series converges, let the corresponding neighborhood N con- 
sist of the points (b, }> c/,(z— b)") for b in the disc and )> c/,(z —b)” the 
power series expansion of the function h at z = b, i.e., the same function 
h, expanded at 6. Then If: R — C is continuous, and a (continuous) 
cross-section k of II over an open set U Cc C assigns to each b € U 
a power series (and hence a holomorphic function) converging in some 
circle about 6. Moreover, the topology on R insures that these power 
series “fit” together at nearby points; hence the above cross-section k is 
exactly a function holomorphic on the domain U. In other words, holo- 
morphic functions are exactly continuous cross-sections k, and k can be 
described as the “analytic continuation” of the power series h above. 
Actually, we should enlarge the complex plane C to the Riemann sphere 
by adding the usual one point at oo, and enlarge R correspondingly to 
Rt, adding over infinity the power series > c,/z” convergent for |z| > r, 
for some r. Then the Riemann surface of a holomorphic function k, as 
above, is just the (largest) path-connected component of R* containing 
the set k(U). Then R*+ — {C,0o} is a space which puts together all 
Riemann surfaces. As we will see in the next sections, the consideration 
of this space over {C, co} is equivalent to the considerations of the sheaf 
of holomorphic functions on {C, 00}. This is the sense in which analytic 
continuation and Riemann surfaces provide an early example of sheaves. 
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The proof that every sheaf is a sheaf of cross-sections of a suitable 
bundle depends on the idea of a “germ” of a function. Two holomorphic 
functions h, k: U — C are said to have the same “germ” at a point 
a € U if their power series expansions around a are the same; this 
implies that h and k agree on some neighborhood of a. In other cases, 
convergent power series expansions may not exist, but one may still 
say that two continuous (real-valued, say) functions f and g have the 
Same “germ” at a point x if they agree in some open neighborhood of z; 
thus germ, f = germ,g implies fr = gz, but not necessarily conversely. 
More generally, consider any presheaf P: O(X)°P — Sets on a space 
X, a point x, two open neighborhoods U and V of x, and two elements 
8 € PU, t € PV. We say that s and ¢ have the same germ at x when 
there is some open set W C UNV with 2 € W and slw = tlw € PW. 
This relation “has the same germ at x” is an equivalence relation, and 
the equivalence class of any one such s is called the germ of s at 2, in 
symbols germ, s. Let 


P, = {germ,s | s € PU,x €U open in X} (1) 


be the set of all germs at z. Then, letting P®) be the restriction of the 
functor P: O(X)°? —+ Sets to open neighborhoods of x, the functions 
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germ,: PU — P, form a cone on P(®) as on the right of the figure below 
[because germ,s = germ,(s|y) whenever c € W C U and s € PUI. 
Also, if {ry: PU + L}zcy on the left below is any other cone over P®), 
the definition of “same germ” implies that there is a unique function 
t: P, > L, 


PU 
tu/ py \BrMs 
I <-~-----5------- P, 


with to germ, = 7. This just states in detail that the set P, of germs 
is the colimit and germ, the colimiting cone of the functor P restricted 
to open neighborhoods of z: 


P, = lim PU. (2) 
—_ 
zeU 


This statement summarizes the definition of “germ”. The set P, of all 
germs at x is usually called the stalk of P at x. Moreover, any morphism 
h: P — Q of presheaves (any natural transformation of functors) induces 
at each point  € X a unique function hz: P, — Qz such that the 
diagram 


Py —_, gu 
germ, | [em - (3) 
Py 2 5-Q5 


commutes for any open set U with x € U. It follows that P 1» P,, 
hw hg is a functor SetsO(*)"” _, Sets, “take the germ at zx”. 

Now combine the various sets P, of germs in the disjoint union Ap 
(over z € X), 


Ap =||P, = {all germ, s |x € X,s € PU}, (4) 
z 
and define p: Ap — X as the function sending each germ,s to the point 
x where it is taken. Then each s € PU determines a function § by 


&:U - Ap, sx = germ,s, xreu; (5) 
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moreover, § is a section of p. In this way, each element s of the original 
presheaf can be replaced by an actual function $ to the set Ap of germs. 
Topologize this set Ap by taking as a base of open sets all the image sets 
3(U) C Ap; thus an open set in Ap is a union of images of the sections 
§ This topology makes both p and every function § continuous. If 
s € PU and t € PV determine sections § and ¢ which agree at some 
point c € UNV, then the definition of germ shows that the set of all 
those points y € UNV where sy = ty is an open set W C UNV, such 
that s|w = tlw. This proves that each § is continuous; it is trivially an 
open map and an injection. Hence §: U — 8(U) is a homeomorphism. 
Finally, if h: P — Q is a natural transformation between presheaves, the 
disjoint union of the functions hz: P, — Qz of (3) is a map Ap — Ag 
of bundles, readily shown continuous. Thus P + Ap is a functor from 
presheaves to bundles. 

The bundle p: Ap — X so constructed is a local homeomorphism, 
in the sense that each point of Ap has an open neighborhood which is 
mapped by p homeomorphically onto an open subset of X. Specifically, 
each point germ,s has the open neighborhood sU, and p restricted to 
5U has s: U — 8U as a two-sided inverse, hence is a homeomorphism 
to U. 

The space Ap of such a bundle is usually not Hausdorff. For example, 
let P be the sheaf of continuous real-valued functions on the real line 
R, and compare the two functions g with gx = 0 for all x and f with 
fc = x” for c > 0 and fx = 0 for x < 0. Then at the origin 0, 
germ, f # germog because f and g differ in every neighborhood of 0, 
but germ, f = germ,g if t < 0, so that every neighborhood of germyf 
must intersect every neighborhood of germjg—namely, in one of these 
points germ, f. In the special case of a sheaf H of holomorphic functions 
of a complex variable, germ, f = germ,g implies that f and g agree on 
an open set V containing x, hence on any connected open set containing 
V; from this it follows that the space Ay is Hausdorff. 

Now consider for a given presheaf P the sheaf [Ap of sections of the 
bundle Ap —+ X. For each open subset U of X, there is a function 


nu: PU -TAp(U), nu(s) = 8. 


The process of restricting s to an open subset of U clearly matches the 
7’s, so 7 is a natural transformation of functors 


qn: P—-»ToAp. (6) 
Theorem 1. [f the presheaf P is a sheaf, then 7 is an isomorphism, 


P=TAp. 
In other words, every sheaf is a sheaf of cross-sections. 


86 Il. Sheaves of Sets 


The proof uses both parts of the condition (ii) on collation of ele- 
ments in the definition (§1) of a sheaf. 
First we show that ny is an injection; that is, that 


8=# implies s=t, s,t € FU. (7) 


For § = ¢ means that germ,s = germ,t for each x € U, so there is, 
for each x, an open set V, C U with sly, = t|y,. These open sets 
V, cover U, so that the given elements s and t have the same image 
in PU — JJ, PV,. By the first part of the condition (ii) on collation, 
s=t. 

Next let h: U + Ap be any cross-section of the bundle of germs over 
some open set U. Then for each point « € U there is an open set U, 
and an element s, € PU, such that 


hz = germ, Sz, xz eEU,, Sz, € PUg. 


Now A is continuous and §,U, is by definition an open subset of the 
bundle Ap, so there must be an open set V, C U with x € Vz C U;z and 
hV, C $,U,; that is, with h = s, on Vz. Thus, we have a covering of 
the open set U by open sets V, and an element sz|y, in each PV,. On 
each pairwise intersection V,M Vy, the functions , and s, agree with 
h and hence with each other. This means that germ,s, = germ,sy for 
z in V, 1 Vy, so that szly,nv, = Sylvenv, by (7) above. The family of 
elements s, thus has the same image under both of the standard maps 
I] PVes]] P(N V,). Therefore, by the second part of the collation 
conditions (ii) there exists an s in PU with sly, = s,. Then at each zx, 
A(z) = germ,sz = germ,s, so h = §; the arbitrary cross-section is thus 
in the image of 7. This proves that 7 is an isomorphism. 


Theorem 2. For any presheaf P, the corresponding morphism 
n: P —> TAp of presheaves in (6) is universal from P to sheaves. 


This means that if F is a sheaf and 6: P > F is any map of 
presheaves, there is a unique map o: TAp — F of sheaves such that 
con=8: 


P—"_.TAp 
ig (8) 


! 
F. 


@ 


Proof: Since 7: F —» [Ap is an isomorphism by Theorem 1, one 
may define a map 0: TAp — F of sheaves as o = 7 'T'Ag, so that in 
the following diagram (9), 
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p—7—.TAp 


| a [> (9) 


the bottom triangle commutes. Since 7 is natural, the outer square 
of (9) also commutes; so 0 o7 = 6 as required since 7: F + T'Ap is 
an isomorphism. It remains to prove that o is unique. We state this 
separately as a lemma. 


Lemma 3. Let P be a presheaf on X, and let o, 7: TAp — F be 
two maps into a sheaf F on X. Ifoon=rT0on: P+ F, theno =r. 


Proof: Consider an open set U and a section h € TAp(U). Ifz € U, 
then there exist a neighborhood V, of x and an element s, € P(V,) 
such that h(x) = germ,(s,); and, as in the proof of Theorem 1, we can 
choose V, so small that hly, = 5, = nv, (sz). Thus, o(h)|v, = o(hlv,) = 
on(sz) = TH(Sz) = T(hlv,) = T(h)|v,. Since these sets V, (for x € U) 
form a cover of U and F is a sheaf, it follows that o(h) = r(h). But h 
was an arbitrary section of Ap, so we proved that o =r. 


Notice that in the proof of Lemma 3, we have used only that F is a 
separated presheaf, not necessarily a sheaf. 


Corollary 4. For any topological space X, the category Sh(X) of 


sheaves of sets on X is reflective in the category O(X) of presheaves on 
X, 


Proof: Here reflective means that the inclusion functor 
Sh(X) > Sets?” = O(X) 


has a left adjoint. But Theorem 2 asserts precisely that the composition 
To A is such a left adjoint, with the universal map 7 as the unit of this 
adjunction. 


The left adjoint functor 
TA: Sets°™”” _, Sh(X) 


is known as the associated sheaf functor, or the sheafification functor. It 
carries each presheaf P on X to the “best approximation” [Ap of P by 
a sheaf. 

This associated sheaf functor is left exact, in the sense that it pre- 
Serves all finite limits. Indeed, both of the functors [ and A used to 
define it are left exact (Exercise 6). 
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6. Sheaves as Etale Spaces 


The construction of sheaves as sheaves of cross-sections of a bundle 
suggests that a sheaf F on X can effectively be replaced by the corre- 
sponding bundle p: AF — X. We have seen that this bundle is always 
a local homeomorphism; conversely, any such bundle (called an “étale” 
bundle) can be reinterpreted as a sheaf. 

A bundle p: E — X is said to be étale (or étale over X) when p 
is a local homeomorphism. in the following sense: To each e € EF there 
is an open set V, with e € V C E, such that pV is open in X and 
plv is a homeomorphism V — pV. Thus the vicinity of the fiber over 
each x € X in case X = R? can be pictured as a serving of shishkebab: 
Through each point of the fiber there is a horizontal open disc on which 
the projection p is a homeomorphism to a disc in R?. The discs at 
different points of the fiber (pieces of lamb or onion, say) may come in 
very different sizes. All these servings over different points x € X are 
“glued together” by the topology of E. In particular, a covering space 
X — X is étale, and in this case all the discs on any one fiber can be 
taken of the same size. As we will see, there are many étale maps which 
are not coverings. On the other hand, a projection X x R -» X of a 
product with the reals can never be an étale map, because no product 
neighborhood is projected homeomorphically into X. For much the same 
reason, a nontrivial vector bundle is evidently never étale. 

If p: EF -» X is étale and U C X is open, the pullback Ey — U as 
in 


Ey ——— E 


rb 


U ——+X 


is also étale over U. A section of E will always mean a section s of 
Ev for some open U; that is, a continuous s: U — E such that the 
composite ps is the inclusion U Cc X. 

Contemplation of the definition of “étale” yields the proof of 


Proposition 1. For p: E — X étale, both p and any sections of 
p are open maps (carry open sets to open sets). Through every point 
e € E there is at least one section s: U + E, and the images sU of all 
sections form a base for the topology of E. If s and t are two sections, 
the set W = {x | sx = tz} of points where they are both defined and 
agree is open in X. 


Now bundles over X and presheaves on X may be compared as 
follows. 
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Theorem 2. For any space X there is a pair of adjoint functors 
Top/X =Bund X —— Sets”. (2) 


here [ assigns to each bundle p: Y — X the sheaf of all cross-sections 
of Y, while its left adjoint A assigns to each presheaf P the bundle of 
germs of P. There are natura] transformations 


np: P TAP, ey: ATY > Y, (3) 


for P a presheaf and Y a bundle which are unit and counit making A in 
(2) a left adjoint for T’. If P is a sheaf, np is an isomorphism, while if Y 
is étale, ey is an isomorphism. 


Proof: The natural transformation np has already been constructed 
in §5, in the inevitable way, sending each s € PU to the corresponding 
cross-section §: U + Ap. 

For ey we also use the inevitable construction. Given a bundle Y > 
X, each point of the corresponding étale bundle ATY has the form sx 
for some point x € X and some actual cross-section s: U — Y of the 
given bundle. The “inevitable” definition of € is 


ey ($x) = sx € Y, reu, séETy. (4) 


This definition is independent of the choice of s, for if some other section 
t: V + Y has the same germ, $x = fa at 2, then s = t on some 
neighborhood of x, so sx = tx. Similarly, one verifies the continuity of 
ey: ALY — Y; it is therefore a map of bundles. It is also natural in Y. 

The space AIY is in general much bigger than Y; for example, a 
vector bundle over X is by no means étale over X (two cross-sections of 
Y may intersect at just one point). However, when Y is étale over X, ey 
is an isomorphism, as we may prove by constructing an inverse Oy. To 
each point y of Y with py — x there is an open neighborhood U of z in 
X and a cross-section s: U —» Y passing through y, so with sx = y. We 
define Oy y = sx € ATY, and verify readily that this is independent of 
the choice of the cross-section s and that 6y: Y —» ATY is continuous. 
It is manifestly a 2-sided inverse for ey of (4). 

Next we observe that 7 and ¢€ have the property that both composites 


Tr Te A € 
Psa ty aa Sn (5) 


are identities. For example, given a bundle Y over X and a cross-section 
s €TyU, the first composite in (5) sends s first to s € TATyU and 
thence, via «, back to s. Similarly the second composite is germ,s > 
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germ, +> sx = germ,s, for x € X and s a cross-section over some open 
neighborhood of zx. 

From these two identities on 7 and «¢ it follows formally that T is 
left adjoint to A; indeed, these are the two “triangular identities” for an 
adjunction, which show that 7 and ¢ are respectively the unit and the 
counit of an adjunction. This completes the proof of the theorem. 

Let Etale X be the full subcategory of the category Bund X = 
Top/X consisting of étale bundles. 


Corollary 3. The functorsT and A from Theorem 2 restrict to an 
equivalence of categories 


Sh(X) === Etale X. (6) 


Moreover, Sh(X) is a reflective subcategory of SetsO(*)"" (as already 
asserted in Corollary 5.4), and Etale X is a coreflective subcategory of 


Bund X. 


The second part of the corollary states that the inclusion functor 
Sh(X) + Sets?” has a left adjoint, and that the inclusion functor 
Etale X — Bund X has a right adjoint. 

The proof of the corollary from Theorem 2 is an easy purely formal 
exercise, which may be formulated as the following condition that an 
adjunction restricts to an equivalence of suitable subcategories. 


Lemma 4. Consider an adjunction (left adjoint on the left) 
A: P= BT. (7) 
which satisfies the following two conditions on unit n and counit e: 
B € B implies that nrg: TB + TAB is an isomorphism, (8) 
P €P implies that exgp: ATAP — AP is an isomorphism. (9) 


Let Po be the full subcategory of P with objects P all those isomorphic 
to some ['B, while Bo is that full subcategory of B with objects those 
B isomorphic to some AP. Then A andT restrict to an equivalence of 
these subcategories, as in the top row of the diagram 


Ao: Po ——== Bp: To 


| (10) 


A: PB: T 


moreover, Pp is reflective in P and Bo is coreflective in B. 
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Proof: The hypotheses (8) and (9) show that the unit and counit, 
when restricted to Po and Bp, do satisfy the triangular identities and 
are isomorphisms, so give the asserted equivalence of Pp to By. Now let 
i: Po ~ P and j: Bo — B be the inclusions of the full subcategories, as 
in (10). Since the image of A is in Bp, it restricts to a functor A’: P + Bo 
with jA’ = A, and the original adjunction restricts to an adjunction 


A’: P= By: Tj. 


Therefore, composing this last adjunction with the equivalence in the 
top row of (10) makes [pA’ left adjoint to ['jAp, where jAg = Ai and 
TAi = i by the hypothesis (8). In other words, [pA’ is left adjoint 
to the inclusion 7: Pp — P; this means that Po is indeed a reflective 
subcategory of P (which by definition means that the inclusion Py — P 
has a left adjoint). An analogous proof shows that Bo is coreflective in 
B. 


This lemma applies in our case, first, because cross sections of any 
bundle B give a sheaf 'B, while nr is an isomorphism for any sheaf F, 
as required in (8), and, second, any presheaf P determines a bundle AP 
which is étale, while by Theorem 2 eg is an isomorphism whenever the 
bundle B is étale, as required by (9) above. 

There is a different, more conceptual, way to construct the bundle- 
presheaf adjunction of Theorem 2. Each open set U of a topological 
space X immediately gives an inclusion U — X and hence a functor 
A: OpenX — Top/X, as in the diagram 


SetsO(*)" .--B-- Top/X (UX) 
| a a 
A A 
O(X) U 


Then each bundle p: Y —» X determines a presheaf R(p), with 
R(p)(U) = Hom(A(U), p: Y + X) = Homtop/x(U — X,Y — X). 


But R(p)(U) is then exactly the set of cross sections of p over U, so 
R(p) = ['(p) is the cross-section functor. The Theorem I.5.2 then states 
that this functor has a left adjoint A’, described there as a colimit. 
Since the left adjoint is unique up to equivalence, this left adjoint A’, 
constructed here using colimits of bundles, must be isomorphic to the 
left adjoint A constructed using germs as in Theorem 2 above. 

As a consequence of the equivalence of sheaves on X and étale bun- 
dles over X, every sheaf can be viewed as a sheaf of cross-sections. 
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Indeed, for each sheaf F' consider the associated étale bundle AF — X 
over X; after replacing each set FU by the isomorphic set of sections 
of AF over U, we can assume that each element s € FU is an actual 
section s: U — AF of the bundle, and for W C U that the restriction 
map FU — FW is the actual restriction of a section. In the notation 
above, this amounts to identifying each s with 3, so that sx = germ,s 
for every x in the domain of s. 


Corollary 5. Given sheaves F and G over X, a morphism h: F —+ 
G of sheaves may be described in any one of the following three equiva- 
lent ways: 


(a) as a natural transformation h: F — G of functors; 

(b) as a continuous map h: AF — AG of spaces (bundles) over X; 

(c) asa family hz: F, —> G, of functions, on the respective fibers over 
each x € X, such that, for each open set U and each s € FU, the 
function x +> hz(sx) is continuous U > AG. 


Condition (c) can be reformulated as follows: As a map h: AF — AG 
of sets over the set X such that each composite ho s is continuous on U. 


Proof: We already have the equivalence of the descriptions (a) and 
(b). Given the hy: FU — GU natural in U, determine the functions 
h,: Fy — Gz as in the diagram (5.3). Since sr = germ,s for s € FU 
and x € U, the diagram (5.3), with P, Q there replaced by F, G, yields 


(hy s)x = germ, (hy s) = hg(germ,s) = hz(sx); 


therefore, for « € U, the function z +> hzsz is hys € GU, hence is 
continuous. Conversely, the family of hz satisfying these continuity con- 
ditions determine hy and, by disjoint union over the fibers, the map 
h: AF —+ AG of bundles. 

This last point of view, of sheaf maps h: F —» G in terms of stalk 


maps h,: F, — Gz, is convenient, for example, when one describes epis 
and monos in the category of sheaves. 


Proposition 6. A map h: F — G of sheaves on a space X is an 
epimorphism (respectively, a monomorphism) in the category Sh(X) iff 
for each point x € X the map of stalks hz: F, — Gz is a surjection 
(respectively, an injection) of sets. 


One possible proof of this proposition makes use of a construction, 
related to the stalk-functor at a given point x, 


Stalk,: Sh(X) — Sets, 
Fu F, =lim F(U), 
—. 
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where in the colimit U ranges over all open neighborhoods of the point 
zg € X. Consider for each set A the “skyscraper sheaf” concentrated at 2, 
denoted by Sky,(A). This sheaf is defined, as a functor O(.X)°P — Sets, 
by 

A if2euw, 
1 otherwise, 


Sky,(A)(U) = { (11) 


where 1 denotes some fixed one-element set. For an inclusion U D V of 
open sets, the restriction map Sky, (A)(U) —-> Sky,(A)(V) is the evident 
one: the identity A — A if « € V (and hence z € U), and the unique 
map into 1 ifz ¢ V. 


Lemma 7. For each point « € X, the stalk functor 
Stalk,: Sh(X) — Sets is left adjoint to the sky-scraper sheaf functor 
Sky, : Sets — Sh(X). 


Proof: Given a set A and a sheaf F, the bijective correspondence 
between set functions ¢: F, — A and sheaf maps h: F — Sky,(A) is 
described as follows. From a function ¢: F, —» A define the sheaf map 
h: F > Sky,(A) by components 


hy: F(U) — Sky,(A)(U); 


if « ¢ U then there is one such function F(U) — Sky,(U) = 1, and if 
x € U then for any element s € F(U), define hy(s) = d(germ,(s)) € 
A = Sky,(A)(U). Conversely, a sheaf map h: F — Sky,(A) induces 
aset-map ¢: F; — A: elements of F, are of the form germ,(s) where 
s € F(U) for some open neighborhood of z, and the function ¢: F, ~ A 
sends this element germ,(s) to hy(s) € A = Sky,(A)(U). One readily 
verifies that these constructions, of h from ¢ and conversely, are well- 
defined, mutually inverse, and natural in F and A. 


Proof of Proposition 6: (<) Suppose h: F — G is a map of 
sheaves such that each stalk map is a surjection. To see that A is epi, 
consider sheaf maps k and @: G — H such that kh = fh. Then at each 
point « € X, the stalk maps have kz oh, = (koh), = (Coh)z = C2 ohg, 
hence k, = é, since hz is surjective. But k and @ are determined by 
their effect on stalks, by Corollary 4. Hence, k = @. In exactly the same 
way, it follows from Corollary 4 that k is mono if each stalk map hg is 
injective. 

(=>) For the converse, consider first the case of epis. If h: F > G is 
an epimorphism in the sheaf category, then each stalk map hz: F; > Gz 
is an epi in sets; i.e., a surjection. Indeed, the stalk functor has a right 
adjoint by the lemma, hence preserves colimits. But then it must also 
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preserve epis, since in general a map hi is epi iff the corresponding square 


eo +6 
| 
e e 


is a pushout. 
Now consider the case of monos. If h is a mono in the sheaf category, 
then 


F 


I 


is a pullback. Since pullbacks of sheaves are computed “pointwise” (see 
Proposition 2.2), it follows for each open set U C X that the square 


F 


FU) =, GU) 
is a pullback of sets. Therefore, hy is injective. Now suppose germ,(s) 
and germ,(t) are two elements of F, represented by s € F(U) and 
t € F(V), such that h,(germ,(s)) = hz(germ,(t)). By definition of hz, 
this means that 


germ, (hy (s)) = germ,(hy(t)). 


Hence, for some smaller neighborhood W C UNV, we have hu(s)|w = 
hv (t)|w. Thus, by naturality of h, 


hw(slw) = hu(s)|w = hv (t)|lw = hw(tlw). 


Since hy is injective, it follows that s|w = tlw; hence germ,(s) = 
germ,(t). Thus, h,: F; — G, is an injective function. 


Since the category of sheaves is equivalent to that of étale spaces, 
many authors define a sheaf of sets on X to be aspace E — X étale in 
X (for example [Swan]). For these authors, what we call the sheaf F of 
continuous functions on (open sets of ) X becomes the sheaf of germs of 
continuous functions, and similarly for other sheaves, such as the sheaf 
of germs of differential forms—and every sheaf is described as a sheaf 
of germs of ---. We have chosen to define sheaves as (special) functors 


7, Sheaves with Algebraic Structure 95 


because we will also consider non-topological cases where the étale space 
is not available and sheaves consequently must be defined as functors. 


7. Sheaves with Algebraic Structure 


For homology and cohomology groups in algebraic topology and al- 
gebraic geometry, the “coefficient” groups must often be taken locally; 
that is, should be sheaves of groups. This is one of the basic reasons for 
considering sheaves. We have already seen examples of sheaves of groups 
and of rings; we now show how such sheaves can be defined systemat- 
ically, by diagrams, starting with sheaves of sets. An abelian group is 
a set A with a binary operation (addition), a unary operation (additive 
inverse), and a nullary operation (zero, considered as a function defined 
on A° = 1= {x}) as in 


(ry)maty, rr—z, *0, 


a v U 1 
AxA-A, A-~A, 1—- A, ) 


and satisfying certain identities: Associative and commutative laws for 
a, v a left inverse for a, and u a left zero. Each of these identities can 
be written as a commuting diagram involving the arrows a, v and u of 
(1) [(CWM, pp. 2-5]. In any category C with finite products (and hence 
with a terminal object 1), we can thus define an abelian group object of 
C to be an object A together with the three arrows (1), for which the 
indicated diagrams all commute. The abelian group objects in C form 
the objects of a category Ab(C), with arrows f: A — A’ those arrows 
of C which commute with the three “structure maps” a, v, and u of (1). 
An abelian group object in Sets is then exactly an (ordinary) abelian 
group. This observation about abelian groups applies also to groups, to 
rings, to modules, etc., in C; that is, to any algebraic structure defined 
by one or more n-ary operations satisfying specified identities. 

These diagrammatic formulations will apply to define sheaves of 
abelian groups. A presheaf P of abelian groups on a space X is de- 
fined to be an abelian group object in Sets°(*)””. Since the product 
P x P in any functor category Sets©”” is the product in Sets, taken 
pointwise, the operation of addition a for such a P is a transforma- 
tion ay: PU x PU — PU, natural for U € O(X). The same holds 
for the other operations, so each set PU is an abelian group. Thus, 
a presheaf of abelian groups on X may also be described as a functor 
P: O(X)°? —» Ab, where Ab is the category of all small abelian groups 
(abelian group objects in Sets). Similarly, a sheaf of abelian groups 
is an abelian group object in Sh(X); or equivalently, it is a functor 
F: O(X)°P — Ab such that the composite with the forgetful functor 
Ab — Sets is a sheaf of sets. 
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A bundle of abelian groups over the space X is defined by the same 
diagrams; it is an abelian group object Y in the category Bund X of 
spaces over X. Now in this category the product of a bundle p: Y > X 
with itself is just the pullback of p with p in Top: 


Y xXx Y ——— Y 


| | 


CSS ss 
it is the subspace of Y x Y consisting of all those pairs ( y, y’) of points of 
Y which lie in a common fiber of p. Therefore, the operation of addition 
in the group object Y is just an addition in each fiber p-1x. A bundle 
of abelian groups can also be described as a bundle p: Y — X of spaces 
in which each fiber p~'z is an abelian group in such a way that the 
resulting operations of addition +: Y xx Y —» Y and inverse Y - Y 
are continuous maps (of bundles over X), as in (4.2) for vector bundles. 

Note also that the product of two bundles, both étale over X, is 
itself étale over X, because the pullback of two local homeomorphisms 
is a local homeomorphism (see Lemma 1 of §9 below). 

For categories C and D, any functor A: C + D which preserves finite 
products will carry any abelian group object in C to another such in D; 
it thus induces a functor AbC — AbD. Now, since the product bundle 
Y xx Y’ is a pullback in Top, a cross-section of the product bundle 
is just a pair of cross-sections, one of Y and one of Y'. Therefore, the 
cross-section functor !: Bund X — Sets?\*)”” preserves products. So 
does its left adjoint A: Sets?” —, Bund _X; indeed, each fiber for 
AP at x € X is given by a colimit (5.2) taken over the subcategory of 
O(X)°P which consists of all open U containing z. Now given sets U 
and V in this subcategory, their intersection UM V is also there, with 
arrows U + UNV and V — UNV. This means that the subcategory is 
“filtered”, and filtered colimits are known [CWM,, pp. 211-212] to com- 
mute with finite limits, in particular with binary products; this shows 
that A preserves products (see also Exercise 6). For these reasons the 
adjointness Theorem 6.2 and its corollaries apply also to abelian group 
objects, giving a pair of adjoint functors 


Ab(Bund) aoe Ab(Sets?™””), 


and similarly for bundles of rings or of real or complex vector spaces. 
In particular, the inclusion Ab(Sh(X)) — Ab(Sets™””) is reflec- 
tive (has a left adjoint, called “sheafification” for presheaves of abelian 
groups). 
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Over each sheaf of rings there will be sheaves of modules. Specifically, 
let R be a sheaf of rings on a space X (i.e, X is a ringed space with 
structure sheaf R). Thus, R is a ring object in Sh(X), so there can be 
left R-module objects in Sh X; they are the sheaves A of left R-modules. 
In other words, they are sheaves A of sets, such that each A(U) is an 
R(U)-module and such that each restriction V C U induces a morphism 
of the R(U)-module A(U) to the R(V)-module A(V). 

We leave the reader to formulate the description of the corresponding 
étale spaces. 


8. Sheaves are Typical 


For any space X, we now observe that the category Sh(X) of sheaves 
of sets on X is a topos, i.e., it has all the properties discussed for our 
“typical” categories in Chapter I. We have already seen in Proposi- 
tion 2.2 that Sh(X) has all finite limits; and that the product of two 
sheaves is in fact their product as presheaves. This suggests that the 
exponential for presheaves [i.e., for functors on O(X)°?| can also be used 
for sheaves; that this is so is an immediate consequence of 


Proposition 1. If F is a sheaf and P a presheaf of sets on the space 
X, then the (presheaf) exponential F? is a sheaf. 


Proof: Both F and P are functors O(X)°? — Sets, and the expo- 
nent was defined, in §1.5, to be the functor FP with 


F?(U) = Hom(y(U) x P, F) (1) 


for all open U, where Hom is the hom-set of the functor category 
Sets©(* Ta while y(U) is the representable presheaf determined by U. 
Thus y(U)(V) is 1 or empty according as V C U or not; therefore the 
natural transformations in F?(U) need really be defined only for open 
sets V CU. In other words, 


F?(U) = Hom(Plv, Flv) (2) 


where Hom refers to the category of presheaves on U, and where Ply 
and Fly are the functors P and F restricted to O(U)°P. Moreover, 
FP(U) is a functor of U in the evident way: if V C U then each natural 
transformation a: Ply — Fly restricts to aly: Ply + Flv. Thus F? is 
that presheaf whose values on each open set U are the maps Ply — Fly 
of presheaves on U. It is routine to verify that F? so defined is a sheaf: 
given a covering U = UL), U, and natural transformations 7;: Ply, > Flu, 
for all i, one can collate these 7; to r: Ply —+ Fly by collating their 
values (which are collatable, because they lie in the sheaf F ). We leave 
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the details to the reader, with the remark that a later more general 
method will allow a more conceptual proof (see Proposition III.6.1 and 
Lemma V.2.1). 

In particular, when P = G and F are sheaves, so is the (presheaf) 
exponential F°. It is often called the “internal hom”, the “sheaf-valued 
hom”, or the “sheaf of germs of morphisms G — F”, and is then written 


Hom(G, F) = F°. 


Finally, we introduce a particular presheaf 2 on X by taking NU to 
be the set of all open subsets of U, 


QU ={W|W CU, W open in X }, (3) 


for every open set U in X. It is a functor of U by intersection: If V c U, 
then W +> WV is the induced map of NU to OV. 


Theorem 2. For any topological space X, the presheaf 2 defined 
in (3) is a sheaf on X, and is a subobject classifier for Sh(X). 


Proof: To show 2 a sheaf, consider any open covering of U by open 
sets U;. Then given open sets V; C U; for all i with V;NU, =V,NU; C 
U, NU; for all i and j, there is clearly a unique open set V, namely, the 
union of the V;, with VNU, = V; for all i. This verifies the equalizer 
condition which states that 2 is a sheaf. 

Now consider a subobject S C F of a sheaf F’. We may assume 
(Corollary 2.3) that S is a subsheaf of F' so each S(D) is a subset of F(D). 
As the corresponding characteristic natural transformation ¢: F > 9, 
we propose the function 


gy: FU — QU, 


which sends each x € FU to the union W of all those open subsets W; 
of U for which z|w, € SW;. Since S' is a subsheaf, it is immediate that 
z|w € SW. It is also clear that @y so defined is natural in U. Next 
consider the pullback of true along ¢y 


P ------- >1 PU ------- >1 

r ie | 

true ( 

\ { 

F—_+9, FU ——+Qu, 
g gu 


where “true” is the map which for each U sends the point of 1 to the 
maximum element U € NU. The pullback is taken pointwise, at each 
open set U, and produces PU as the subset of FU consisting of those 
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g € FU with ¢éyx = U; in other words, the pullback P is exactly the 
given subsheaf S. Moreover, if S is the pullback of true along any other 
arrow w: & — 2 one readily shows that 7) must be ¢, so ¢ is unique, 
and 2 with the map true: 1 —- 12 is therefore a subobject classifier. 


Note the resemblance to the characteristic functions used for 
presheaves (i.e., for the functor category) in §I.4. There, given a sub- 
functor S C T, the characteristic function gave all the “paths to truth”; 
here, by the definition of ¢ above, it gives the “shortest path” W C U 
to truth. 

Finally, Sh(X) has all small colimits. Indeed, the sheafification 


L: Sets°™”” —s Sh(X) 


is left adjoint to the inclusion functor, so L must preserve colimits. More- 
over (§5), if F is a sheaf, LF ~ F. Hence, given two sheaves F and G, 
their coproduct F Li G as presheaves (their pointwise disjoint union) 
yields a sheaf L(F I G) which is their coproduct 


FY LF L(FUG)~—LG2G 


as sheaves. Coequalizers and other colimits are treated similarly. Thus 
we have shown that Sh(X) has finite limits and colimits, exponentials 
(Proposition 1) and a subobject classifier (Theorem 2) and hence is an 
elementary topos (81.6). 

The category Etale X of all spaces étale over X is equivalent (§6) 
to the category Sh(X), hence also has all the listed properties of our 
typical categories; i.e., Etale X is also an elementary topos. 


9. Inverse Image Sheaf 


A continuous map of spaces, f: X — Y, will induce functors in 
both directions, forward and backward, on the associated categories of 
sheaves. In §1 we have already observed that each sheaf F on X yields 
an induced sheaf f,F on Y, defined for each open set V on Y by 


(f-F)(V) = F(f-'V). 


This f,F was called the direct image of F under f, while f, is a functor 
f.: Sh(X) — Sh(Y). In the other direction, each bundle E — Y, pulled 
back along f, yields a bundle f*E — X over X, and f* is a functor 
f*: Bund Y — Bund X. Moreover, 


Lemma 1. If f: X — Y is continuous and p: E — Y is étale over 
Y, then ft E — X is étale over X. 
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Proof: In the pullback f* E, consider any point (2z,¢) consisting of 
a pair of points c € X and e € E with fz = pe. Since p is étale, there 
is an open neighborhood U of e in E mapped homeomorphically by p 
onto an open set pU in X. Then f~'(pU) x U is an open neighborhood 
of (z,e) in the product X x E, so its intersection with the pullback is 
an open neighborhood of (z,e) there and is mapped homeomorphically 
onto f~1(pU) in X. Thus f*E is étale, as desired. 


By this lemma, each continuous f: X — Y gives a functor Sh(Y) > 
Sh(X) via the equivalence of Corollary 6.3, namely, as the composition 


sh(y) 4+ Etaley 4+ Etale X —"_. Sh(X). (1) 
We denote this composition again by 
ft: Sh(Y) — Sh(X). 


For a sheaf G on Y, the value f*(G) € Sh(X) of this functor is called 
the inverse image of G (under f). It is determined, in terms of its étale 
bundle, by the pullback square in the diagram 


A(f*G) ———> AG 


pe 


X —,—Y. 


f 


For any open set U C X, a section t € (f*G)(U) of the inverse image 
sheaf can thus be described, by the definition of a pullback, as a contin- 
uous map t’: U + AG such that pt’! = fly: U + Y. In other words, a 
section ¢ corresponds to a lifting t’ to AG of the map fly: U —» Y, in 
the sense that the diagram 


U—— Y 


flu 


commutes. In particular, for each open set V C Y and each s € G(V), 
we obtain a section 5: V — AG as in (5) of §5, and hence by composition 
a map t’ as above and therefore, by the pullback (2), a section 


ta: f-'(V) > A(f*@). (3) 
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More precisely, by the pullback diagram (2), a point of A(f*G) is of the 
form (z, germs ,)(s)), where x € X and germy,)(s) € (AG) f(z) is the 
germ of some s € G(V) for some open neighborhood V C Y of f(z). 
Writing points of A(f*G) as such pairs, we can write the section t, of 
(3) explicitly as 


ta(x) = (a, germy(ey(s)),  (w € f-*(V)). (4) 


Notice that since Af*G — X is étale, the image of each such section 
tg: f-1(V) — A(f*G) is open. Moreover, these images, for all open V C 
Y and all s € G(V), cover Af*G, as is clear from (4) since every point 
of Af*G has the form (x, germ, ,)(s)) for some x and s. Consequently, 
for any topological space T whatsoever, a function k: Af*G — T is 
continuous iff k ot,: f~!(V) — T is continuous, for every V and s as 
above; in other words, continuous = continuous on each section. We 
shall use this observation in the proof of the following theorem. This 
theorem is used to replace continuous maps between spaces by adjoint 
pairs of functors between their sheaf categories. 


Theorem 2. If f: X — Y is a continuous map, then the functor 
f*, sending each sheaf G on Y to its inverse image on X, is left adjoint 
to the direct image functor f,: 

f* 7 
Sh(X) = Sh(Y), f° 4 fe. 

Proof: Let F be a sheaf on X and G a sheaf on Y. We shall 
prove the desired isomorphism Sh(X)(f*G, F) = Sh(Y)(G, f.F’) in the 
following intermediate steps, reminiscent of the three descriptions in 
Corollary 6.5 of maps of sheaves: 


Sh(X)(f*G, F) = Etx(Af*G, AF) 

= K(Af*G, AF) 

~ Sh(Y)(G, f.TAF) = Sh(Y)(G, f.F). 
Here, Et (Af*G, AF) is the set of maps of étale bundles over X, as in §4, 
and K is still to be defined. It will be clear from the construction that all 
these isomorphisms are natural in F and G. Now the first isomorphism 
in this sequence comes from the equivalence between sheaves and étale 
bundles, as in Corollary 6.3, and the last isomorphism comes about by 
composition with the inverse of f.(7): fx —> f.(T AF); indeed, the unit 
n: F — TAF is an isomorphism because F is a sheaf, see Theorem 6.2. 
For the other two isomorphisms we first define K(Af*G, AF) as the 
functions continuous on sections; more formally, K is the set of functions 
k: Af*G — AF of sets over X with the property that, for any open V C 
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Y and any s € G(V), the composition kot,: f~'(V) + Af*G > AF 
is continuous. Now Etx(Af*G,AF) = K(Af*G,AF), by the remark 
immediately preceding the statement of the theorem. 
It remains to show that there is a natural isomorphism 
K(Af*G, AF) © Sh(Y)(G, f.(TAF)). (5) 
From left to right, given k: Af*G — AF in K(Af*G,AF), define 
Tt: G — f,(TAF) as follows: for an open set V, the component 
(tk)v: G(V) > f.(TAF)(V) = (TAF)(f7~!V) sends any element s € 
G(V) to the composite 
(th) v (8) = kot,: f-1(V) —24 AG —4> AF. 
Notice that (7,)v(s) is indeed continuous, by the definition of K. 
Conversely, from right to left in (5), given a natural transformation 
7: G-—» f,(T AF), define a function k,: Af*G — AF as follows. As ex- 
plained below (3), a point of Af*G has the general form (x, germ (,)(5)), 
where s € G(V) for some open set V C Y containing f(x). We define 
k(x, germ f(z)(8)) = Ty (s)(2). (6) 
It is easily seen that this is well-defined, i.e., it does not depend on the 
element s € G(V) chosen to represent the germ, germ y(,)(s) € (AG) f(z). 
Also, if V is open in Y and s € G(V), then by (4) above, 
(k, ots)(x) = k-(x, germ s,,)(s)) 
(7) 
= tv(s)(2); 
so k,ot, = Ty(s) is a continuous map f—?(V) —» AF. Therefore k, thus 
defined is indeed in the set K(Af*G, AF). It remains to be shown that 
these operations 
kh Tr 
K(Af*G, AF) <= Sh(Y)(G, f.(TAF)) ele 
are mutually inverse. But given a natural transformation 7: G —> 
f.(TAF), we have for any open V CY and any s € G(V), 


T(k,)(8) =k, ot, (by definition) 


= Ty(s) (by (7)); 
and given a function k: Af*G -» AF in K, we have for any point 
(z, germ ¢(z)(s)) € Af*G that 


ky, (2; erm 5 (2) (s)) = TK(S)(x) (by definition) 
= (kot,)(x) (by definition) 
= k(t,(x)) 


Thus k +> 7, and 7 +> k, are indeed mutually inverse, so the proof of 
the theorem is complete. 
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Proposition 3. Let f: X — Y bea continuous map of spaces. The 
inverse image functor f*: Sh(Y) — Sh(X) preserves all finite limits. 


Proof: By the definition (1) of f* as a pullback via the equiv- 
alence of Corollary 6.3, it suffices to show that the pullback functor 
f*: EtaleY — Etale X preserves finite limits. But this pullback func- 
tor f* is the restriction of the pullback functor on bundles, as indicated 
in the commutative diagram 


BundY —*— Bund X 


EtaleY 7s ge Etale X. 


And f*: Bund Y —» Bund X preserves (finite) limits since it has a left 
adjoint X, = “compose with f” [cf. Theorem 1.9.4]. Thus, it remains 
to prove that Etale X C Bund X is closed under finite limits, i.e., that 
the inclusion functor i preserves finite limits. If FE — X and E’ > X 
are both étale over X, their pullback E x x E’ in Top is their product 
in Bund X; moreover, by Lemma 1 this pullback F xx E’ > X is étale 
in X, and hence it is also the product of E and EF’ in Etale X. If also 
f, 9g: E —- E' are maps of bundles, an easy topological argument shows 
that their equalizer (in Top and thus in Bund X) is also étale in X, so 
is their equalizer in Etale X. Thus, the inclusion functor i on the right 
above preserves binary products, equalizers, and terminal objects, hence 
Preserves all finite limits (is left exact), as was to be shown. 


This section, in summary, has proved that each direct image functor 
fx: Sh(X) — Sh(Y) has a left exact left adjoint. Here, f, is constructed 
“functorially”, but its left adjoint has been constructed topologically 
(via étale spaces). This left adjoint can also be constructed by Kan 
extensions; see Chapter VII §5.(6). 


Exercises 


1. Show that a sieve S on U in the category O(X) is principal iff 
the corresponding subfunctor S C ly = Hom(—,U) is a sheaf. 

2. A sieve S on U in O(X) may be regarded as a full subcategory 
of O(X). Prove that a presheaf P on X is a sheaf iff for every 
covering sieve S on an open set U of X one has PU = lim, _. PY. 


3. An action G x X — X of a group G on a space X is said to be 
proper if for every point x € X there exists a neighborhood Uz 
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of x with the property that g-U,MU, 4 @ implies g = 1 for all 
g € G. Prove that if G acts properly on X, the quotient map 
a: X — X/G to the space X/G of orbits is a covering. 


. Prove Theorem 3 of §1. [Hint: define a quasi-inverse s: Sh(B) > 


Sh(X) for r as follows. Given a sheaf F on B, and an open set 
U ¢ X, consider the cover { B; | i € I} of U by all basic open 
sets B, € B which are contained in U. Define s(F)(U) by the 
equalizer 


s(F)(U) —— || F(Bi) = [[ (6 B;),] 


ier tg 


. A sheaf F on X is locally constant if each point x € X has a basis 


of open neighborhoods NV, such that whenever U, V € N, with 
U CV, the restriction FV — FU is a bijection. Prove that F is 
locally constant iff the associated étale space over X is a covering. 


. Prove that both the functors [ and A of §5 and hence also the 


associated sheaf functor are left exact (i.e., preserve all finite lim- 
its). 


. For any set 7’, the constant presheaf T on a space X has T(U) = T 


for all open sets U in X, with all restriction maps the identity. 
Show, using germs, that the associated étale space is the projec- 
tion p: X xT — X of the product, where T has the discrete topol- 
ogy; conclude that the associated sheaf is the “constant” sheaf 
Ar, for which Ar(V) is the set of all locally constant functions 
V -+T. Prove also that this defines a functor A: Sets —> Sh(X) 
which is left adjoint to the global sections functor Sh(X) —> Sets, 
FRITF(X). 


. (Sheaves are “collatable’ up to isomorphism, cf. [Serre, 1955, 


Proposition 4].) Let X have an open cover by sets Wy, and 
suppose for each index k that Fy is a sheaf on W,, for each j 
and k that 05%: Fy |(w,nwe) = Fy|(w, nw.) is an isomorphism of 
sheaves, and for each i, j, k that 0;, = 0;,06,, wherever defined. 
Then prove (by reducing the situation to that in Theorem 1.2) 
that there is a sheaf F on X and isomorphisms ¢,: Flw, — Fr 
such that ¢, = 0,,¢, wherever defined. Prove also that F and the 
oe are unique up to isomorphism. 


. Let f: X + Y be an étale map. Show that f*: Sh(Y) — Sh(X) 


has a left adjoint. Give an example of a map f: X — Y such 
that f*: Sh(Y) — Sh(X) cannot possibly have a left adjoint. 

(a) Prove that a map p: Y - X of topological spaces is étale 

iff both p and the diagonal Y + Y xx Y are open maps. 
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(b) Prove that in a commutative diagram of continuous maps 


where p and q are étale, f must also be étale. 


11. Show that the category of sheaves of abelian groups on a space 
X does not have a subobject classifier. 


III 


Grothendieck Topologies 
and Sheaves 


1. Generalized Neighborhoods 


The notion of a sheaf, presented as in Chapter II in terms of cov- 
erings, restrictions, and collation, can be defined and used (e.g., for 
cohomology) not just on the usual topological spaces but also on more 
general “topologies”. This section is meant to provide some informal 
background and motivation for the development of these general topo- 
logical ideas. 

In the usual definition of a topological space and of a sheaf on that 
space, as discussed in the previous chapter, one uses the open neighbor- 
hoods U of a point in a space X; such neighborhoods are topological 
maps U — X which are monic. For algebraic geometry it turned out 
that it was important to replace monomorphisms by more general maps 
Y —»+ X; these will appear in the definition below of a Grothendieck topol- 
ogy. There were at least two motivations for this replacement, arising 
in the context of fibrations, and in that of Galois theory. 

For algebraic groups, Serre observed that some mappings p: W > X 
which ought to be fiber bundles (in the topological sense) were not such. 
The topological definition required that each point x of the base X have 
a neighborhood U over which the bundle becomes trivial; that is, such 
that the pullback bundle W xx U over U is trivial. This did not apply in 
Serre’s examples, but he discovered that by taking a suitable unramified 
covering U’ — U, the pullback W xx U’ did turn out to be trivial. This 
suggested to Grothendieck the idea of replacing inclusions U — X by 
more general maps U’ — X, in defining the “open covers” of X. 

A similar generalization was suggested by Grothendieck’s discovery 
(around 1961, in [SGA 1)) of the analogy between the Galois groups of 
a field and the fundamental group of a topological space, as follows. 

On the one hand, consider a finite normal extension N > K of 
a field K (of characteristic 0, to avoid complications with inseparable 
extensions). This extension is a monomorphism m: K — WN in the 
category of fields (where every morphism is, in fact, mono), and the 
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Galois group G of N over K consists of the field automorphisms ¢: N + 
N with om = m; i.e., of those o which leave fixed all elements of the 
base field AK. The fundamental theorem of Galois theory states that 
subgroups S C G of the Galois group correspond in a one-to-one fashion 
to intermediate fields L with K C L C N, that is, to factorizations 
Kk — L ~— N of the given mono m; given a subgroup H C G, the 
corresponding intermediate field consists precisely of the elements of N 
fixed under all automorphisms in the given subgroup. 

On the other hand, consider topological spaces X, Y which are arc- 
wise connected and locally arcwise connected. A covering space Y of X is 
then an epimorphism p: Y — X in the category of these spaces such that 
each point x of X has a neighborhood U for which each arc-component 
of p-1U is mapped by p homeomorphically to U (see Chapter II, §4); 
more formally, the pullback Y xx U is a coproduct 


YxxU=UUUUU.::- 


of copies of U. The covering group G of p then consists of the covering 
transformations: those homeomorphisms 0: Y — Y such that po = 
p: Y — X. Now suppose that Y is simply connected or, more generally, 
that o: Y — X is a regular covering. [For y € Y, the image under o 
of the fundamental group 71(Y, y) is a normal subgroup of 7(X, cy).| 
Then, exactly as in the Galois theory, the subgroups S of G correspond 
one-to-one to the factorizations Y — Y' — X of p, where Y’ is defined as 
the quotient of Y obtained by identifying two points y, and yo precisely 
when oy; = y2 for some o € S. 

This analogy clearly involves a dualization: spaces over X are epi- 
morphisms to X, while extensions of a field K are monomorphisms from 
kK. In the category of spaces Y — X, the product is given by the pull- 
back Y xx Y’ — X, while the disjoint union provides the coproduct 
YIUY’' — X. The category of fields does not have duals of both these 
operations, but they are present if we embed fields in the larger cate- 
gory of commutative rings. (Henceforth, all rings are commutative, with 
identity element, and all ring homomorphisms preserve the identity.) For 
two rings R and S, the tensor product R® S with the injections 


R>REOSLS, rer@l, s1@s, 


is the coproduct, while for any indexed family of rings R, the cartesian 
product, [] R; with termwise ring operations, is the categorical product. 
Similarly, one may consider algebras A over a field K, defined to be rings 
A equipped with a ring homomorphism K — A. The category of such 
-algebras has coproducts A®x B and products A x B. 

But now the parallel between geometry and algebra seems to fail. 
The definition of a covering space Y — X states that for every point 
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x € X, there is a neighborhood U (an open monomorphism U — X) 
such that the pullback Y xx U — U is a coproduct of copies of U. 
Similarly, any field extension K — L has a splitting field K — N with 
N normal. The fact that N splits L means that the tensor product (the 
coproduct) L@x N is a direct product of copies of N; in elementary 
terms, if D is given, say, as the field L = K(0) generated by a root @ of 
an irreducible polynomial f(x) of degree m in K[z], a normal extension 
N which splits L will contain all the roots 0),...,0m of the polynomial, 
so that L@x N & N[z)/(z — 01)---(w — Am) is just the product of m 
copies of N. 

The difference here is that a covering space is “split” over a neighbor- 
hood U, which is a monomorphism U — X, while a field is split not by 
the dual (an epimorphism), but by a more general map K — N. It was 
this observation that led Grothendieck to think that the neighborhoods 
U in topological spaces could effectively be replaced by maps C — X 
which are not necessarily monic. Then a covering by open sets would be 
replaced by a new-style “covering” by a family of such maps. 

This can be done in any category C with pullbacks. For an object 
C of C, consider indexed families 

S={fi:C; ~-Cliel} 
of maps to C,, and suppose that for each object C’ of C we have a set 
IG AGS 85) 5200} 
of certain such families, called the coverings of C under the rule K. Then 
for these coverings we can repeat the usual topological definition of a 
sheaf. As presheaves on C, we simply take the functors P: C°P — Sets. 
The classical definition of a sheaf on a topological space then required 
that for each open cover { U, | i € I} of some U, every family of elements 
{x, € P(U;) |i € I} which matched on the intersections U; MU, for all 
i and j could be fitted together as a unique element z € P(U). We can 
repeat this definition for any covering S of an object C, replacing the 
intersection U;U; by the pullback C; x¢ C;, as in the diagram 


which yields for a given presheaf P a corresponding diagram of sets 


P(C, Xe Cj Pe) 


Pir, ] Jr 


2. Grothendieck Topologies 109 


It will be convenient to write xf; € P(C,) for the result P(f;)(x) of the 
action of f; on an element x of P(C), as in Chapter I, §4. Then the 
matching condition for a sheaf reads: if { 2, € P(C;) |i € I} is a family 
of elements which match, in the sense that r;v;, = x,h;, for alli and 7, 
then the family determines a unique element « € P(C) such that 


tf, = 2; for all i € I. 


This amounts to the familiar requirement that the following arrow e is 
an equalizer in the diagram 


P(Cc) +> ][[ Pc) = [][ PC xe G), 


i,j 


where e(x) = (xf;: i € I), and the maps on the right are respectively 
(aj: 1 CD) (ayy: i,7 € Ex 1) and (23:7 € I) (a,h,;: 1,7 € Ix I). 

Now to be able to develop a theory of sheaves in such generality, one 
needs some conditions on the rule K which assigns covering families to 
objects of C. It is, in fact, possible to develop the theory, even in the 
extreme case when C does not have pullbacks, by replacing the indexed 
families S' by the sieves which they generate. This will be done in the 
next section. 
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As motivated in the preceding section, we will now introduce a gen- 
eral notion of a category equipped with covering families. Let C be a 
small category, and let Sets©”” be the corresponding functor category. 
As in Chapter I we write 


y:C-3 Sets©” 


for the Yoneda embedding: y(C) = C(—,C). Recall from Chapter I, 
§4, that a sieve S on C is simply a subobject S C y(C) in Sets©”. 
Alternatively, S may be given as a family of morphisms in C, all with 
codomain C,, such that 


fES = fogeS 


whenever this composition makes sense; in other words, S is a right 
ideal. If S is a sieve on C and h: D — C is any arrow to C, then 


A*(S)={g|cod(g)=D, hgeS} (1) 


is a sieve on D. 
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Definition 1. A (Grothendieck) topology on a category C is a func- 
tion J which assigns to each object C' of C a collection J(C) of sieves 
on C’, in such a way that 


(i) the maximal sieve tc = { f | cod(f) = C} is im J(C); 
(ii) (stability axiom) if S € J(C), then h*(S) € J(D) for any arrow 
h: DOC; 
(iii) (transitivity axiom) if S € J(C) and R is any sieve on C’ such 
that h*(R) € J(D) for allh: D > C in S, then RE J(C). 


It follows at once that for S € J(C) any larger sieve R on C (ie., 
any R D S) is also a member of J(C). Indeed, take any h: D > C 
with h € S. Then 1p € h*(S), so A*(S) is the maximal sieve on D. 
Since h*(S) C A*(R), h*(R) must also be the maximal sieve on D, so 
h*(R) € J(D). This holds for all h € S, so it follows by the transitivity 
axiom that R € J(C). 

With this observation, the transitivity axiom has the following con- 
sequence: 


(iii’) If S € J(C) and if for each f: Dy — C in S there is a sieve 
Ry € J(Dy), then the set of all composites fog, with f € S and 
g € Ry, isin J(C). 


A site will mean a pair (C, J) consisting of a small category C and 
a Grothendieck topology J on C. If S € J(C), one says that S is a 
covering sieve, or that S covers C' (or, if necessary, that S J-covers C’). 

We will also say that a sieve S on C covers an arrow f: D3 C 
if f*(S) covers D. (So S covers C iff S covers the identity arrow on 
C.) In this language, the axioms for a Grothendieck topology can be 
formulated as follows (arrow form): 


(ia) if S is a sieve on C’ and f € S, then S covers f; 
(iia) (stability) if S covers an arrow f: D — C, it also covers the 
composition f o g, for any arrow g: E > D; 
(iiia) (transitivity) if S covers an arrow f: D — C, and R is a sieve on 
C which covers all arrows of S, then R covers f. 


The original conditions (i)-(iii) for a Grothendieck topology eas- 
ily follow from these arrow conditions (ia)—(iiia), by considering the 
identity arrow on C. Conversely, one readily derives the latter con- 
ditions from (i)—(iii), For example, consider (iiia): Suppose S covers 
an arrow f: D — C, and R covers all arrows in S. By definition, 
this means that f*(S) € J(D) and h*(R) € J(domA) for all A € S. 
Thus, for an arrow g: E — D in f*(S), we have fg € S and hence 
g* f*(R) = (fg)*(R) € J(E). Since this holds for all such g, the transi- 
tivity axiom (iii) implies that f*(R) € J(D); ie., R covers f. 
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Note that it follows from the axioms that any two covers have a 
common refinement; in fact one has 


(iv) if R, S € J(C), then RNS € J(C); 
or in arrow form: 
(iva) if R and S both cover g: DC, then RNS covers g. 


Indeed, if f: D — Cis any element of R, then f*(RNS) = f*(S) € J(D) 
by (ii), and therefore RMS € J(C) by (iii). This proves (iv); (iva) is 
equally easy. 

A topological space with the usual notion of a cover provides an 
example of a site: The partially ordered set O(X) of open subsets of X 
can be viewed as a category in the usual way (with exactly one arrow 
U — V iff U C V), so that a sieve on U is simply a family S of open 
subsets of U with the property that V’ C V € S implies V’ € S. Then 
specify that S covers U iff U is contained in the union of the open sets 
in S. It is easy to check that this “open cover” definition satisfies the 
axioms for a Grothendieck topology (cf. Exercise 1). In this case an 
arrow f to an object C (i.e., to U) is just an open subset W C U, and 
the sieve S on U covers the arrow W C U iff W is contained in the 
union of the open sets in S. This motivates the “arrow form” above of 
the axioms for a Grothendieck topology. 

In the case of an ordinary topological space, one usually describes 
an open cover of U as just a family {U, | i € I} of open subsets of U 
with union (JU; = U; such a family is not necessarily a sieve, but it 
does generate a sieve—namely, the collection of all those open V C U 
with V C U; for some U;. (Informally, V goes through the sieve if it fits 
through one of the holes U; of the sieve.) It will be convenient to define 
this way of generating a covering sieve in the more general context of 
an arbitrary category with pullbacks, in terms of a so-called basis for 
a Grothendieck topology. (For a similar definition of a basis in case C 
does not have pullbacks, see Exercise 3.) 


Definition 2. A basis (for a Grothendieck topology) on a category 
C with pullbacks is a function K which assigns to each object C a 
collection K(C) consisting of families of morphisms with codomain C, 
such that 


(i') if f: C’ 4 C is an isomorphism, then { f: C’ = C} € K(C); 
(ii) if { fi: C; 2 C |i € I} € K(C), then for any morphism g: D > 
C, the family of pullbacks {m2: C; xc D > D |i € I} is in 
K(D); 
(iii’) if { f;: C, 2 C |i eI} € K(C), and if for each i € I one has 
a family {9:,: Diy > Ci | J € L:} € K(C;), then the family of 
composites { f° gij: Diy > C |i € 1,7 € I;} isin K(C). 
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Condition (ii’) is again called the stability axiom, and (iii’) the tran- 
sitivity axiom. The pair (C, K) is also called a site and the elements R 
of the set AK(C) are called covering families or covers for this site. 


Warning. In the sequel, we will often loosely refer to a topology on 
a category, even where it is clear from the context that we really mean 
a basis for a topology. 


Let us note that, because of (i) above, a topology J is not a basis 
[although (ii’) and (iii’) are satisfied by J]. However, if K is a basis on 
C, then K generates a topology J by 


SeJ(C) = ARE K(C) RCS; (2) 


ie, a sieve is a J-cover iff it contains a K-cover. It is easy to check 
that this indeed defines a Grothendieck topology J from a basis K. For 
instance, let us verify that the stability axiom holds: if S € J(C) and 
g: D > Cis any morphism, choose R C S with R € K(C) as in (2), and 
let T € K(D) be the K-cover of D obtained as in (ii’) above, by pulling 
back R along g; that is, T consists of all those morphisms fh which fit 
into a pullback diagram 


DxcC'—>C 


| [ 


D—_;—C 


for some f € R. Then T € g*(S), so g*(S) € J(D) by (2). 
Notice also that if J is a given topology on C, there is a maximal 
basis K which generates J, given by 


ReEK(C) = (Re JC), (3) 


where 
(R)={fog|feER, dom f =codg} (4) 


is the steve generated by the family R. [Given a Grothendieck topology 
J, one often loosely says that a family R = { f;: C; > C |i € I} covers 
C if (R) € J(C),] 

Condition (iv) above, on intersections of covers, does not hold in as 
simple a form for bases. Say that a family { f;: C; > C |i € I} refines 
another one {g;: D; > C'| j € I'} if every f, factors through some gj. 
Then the analog of (iv) for a basis is: 


(iv’) for any two covers R, P € K(C), there exists a common refine- 
ment in K(C). 
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The assertion (iv’) follows from (iv) and the correspondence between 
bases and topologies: if J is the Grothendieck topology on C generated 
by K, then (R), (P) € J(C), hence by (iv), (R) N(P) € J(C). By (2) 
this means that there is a T € K(C) with T C (R)N(P). But T C(R) 
means that T refines R. Also T refines P, so (iv’) is proved. 

Let us look at some additional elementary examples. 

(a) If C is any category, the trivial topology on C is the one in which 
the only sieve covering an object C’ is the maximal sieve tc. Clearly, 
this topology is the smallest (coarsest) among all topologies on C. 

(b) Let T be a small category of topological spaces, which is closed 
under finite limits and under taking open subspaces. (For example, T 
might be a category of separable Hausdorff spaces.) Define a basis K 
on T by { fj: ¥; ~ X |i €¢ I} © K(X) iff each Y; is an open subspace 
of X with f, the corresponding embedding, while L);., Y; = X. We will 
refer to this basis, or to the corresponding Grothendieck topology, as 
the open cover topology on the category T of spaces. 

It should be observed that to define this open cover topology, we do 
not really need all pullbacks in T to exist; for the stability axiom (ii’) 
above we really need only pullback diagrams of the form 


ie 2) | 


| | 


¥—>—>X 
where U C X is an open subspace. Thus, for instance, on the category 
M of (separable) smooth manifolds M, one can still define the open 
cover topology J, by taking S € J(M) iff there is an open cover {U,} 
of M such that all the embeddings U; — M are in the sieve S. (Never- 
theless, arbitrary pullbacks do not exist in the category M, for example 
intersections of certain submanifolds.) 

The purpose of these open cover topologies is to put all the sites 
O(X) for topological spaces X together in one big site, so as to provide 
@ convenient context in which to consider sheaves which are defined not 
just on one given space X, but on each of the spaces from a large cate- 
gory of spaces. A typical example is the sheaf of real-valued continuous 
functions discussed in §II.1; see also §VI.9. 

(c) A bigger (that is, a finer) Grothendieck topology on the cate- 
gory T above is the one generated by the basis K’, defined by taking 
{ fi: ¥, 2 X |i eI} € K(X) iff f: LLY; — X is an open surjection. 
Here [[Y, is the disjoint sum (coproduct) of the spaces Y,, and f is the 
map induced on the coproduct by the different f;. For the proof that 
this K’ is a basis, use the fact that in a pullback diagram of topological 
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spaces 
Z xXx Y ——— Y 


| | 
Z yt X, 
q is an open surjection whenever p is. 

(d) Complete Heyting algebras. Recall from 81.8 that a Heyting 
algebra is a distributive lattice A equipped with an implication operator 
=>, satisfying 

a<(b=>c) if aAb<ec (5) 
for all a, b, c € A. A complete Heyting algebra (cHa) is a Heyting 
algebra which is complete as a lattice; i.e., sups and infs of arbitrary 


families of elements exist. If A is a cHa, and {a; | i € I} and 5 are 
elements of A, then one has the identity (the infinite distributive law) 


VV (bAaj) = bA\) ai. (6) 


iel ier 


This follows from the property (5) of implication, for if c € A is any 
other element one has 


VbAa<e iff bAa; <c for all i 
ied 
iff a; < (b> c) for all i 
iff \ a; < (b> 0) 
iel 
iff DA VV ai <c. 
ieI 
Conversely, it is not difficult to prove (Exercise 17) that if A is a lattice 


in which arbitrary sups exist and satisfy the identity (6), then A has the 
structure of a cHa in which = is defined by 


boc = V{alanb<c}. (7) 


Now let A be a cHa, and regard A as a category in the usual way (so 
that there is exactly one arrow a — b iff a < b). Then Acan be equipped 
with a basis for a Grothendieck topology K, given by 


{a,|ieI}EK(c) iff Va=e. (8) 
ier 
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(On the left-hand side, we identify an element a; < c with the corre- 
sponding morphism a; — c.) The identity (6) is precisely the stability 
condition (ii’) of Definition 2 above. 

A sieve S on an element c of A is just a subset S of elements b < c 
such that a < 6 € S implies a € S. In the topology J with basis 
K as in (8) above, a sieve S on c covers c iff c = VS, ie, c is the 
sup of the elements in S. Hence, this topology is often called the sup 
topology. In the special case where A is the algebra of all open subsets 
of a topological space X, this is exactly the usual open cover topology. 
In this sense sheaves on a cHa (to be discussed in the next section, and 
in Chapter IX) generalize the usual sheaves on a topological space as 
discussed in Chapter II. 

(e) The dense topology. If P is a partially ordered set and if 
pé€ P,asubset DC {q € P|q <p} is said to be dense below p if for 
any r < p there is a g < r with g € D; in other words, for the portion 
of P below p, the dense set D gets below any element whatever. (This 
notion of dense subsets is used by logicians in relation to forcing; the 
corresponding sheaves yield models of set theory which violate axioms 
such as the continuum hypothesis, cf. Chapter VI below.) The dense 
sieves form a topology J on P by 


J(p) ={D|q<p for all q € D, and D is a sieve dense below p }. 


[Actually covering families D € J(p) should consist of arrows q — p, 
but we again identify these with elements q such that g < p.] It is easy 
to see that J satisfies the conditions of Definition 1. 

The dense topology can in fact be defined for an arbitrary category 
C: for a sieve S, let 


S € J(C) iff for any f: D> C there isa g: E> D (9) 
such that fg € S. 


Then J becomes a topology on C. For reasons which will become clear 
in Chapter VI, one also refers to the dense topology as the -—-topology. 
(f) The atomic topology. Let C be a category, and define J by 


S € J(C) iff the sieve S' is nonempty. 


For J to satisfy the stability axiom in this case we must assume that 
any two morphisms f: D — C and g: E — C with a common codomain 
C can be completed to a commutative square 


f an 


by eo-- 
Q 
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—a condition much weaker than the existence of pullbacks. Indeed, J 
as defined above is a topology iff C satisfies this condition (10). 

A particularly interesting case which we will consider in this chap- 
ter is that where C is the opposite category of the category I of finite 
sets and injective functions; see §9 below. Notice also that the atomic 
topology is a special case of the dense topology. 

The final example of a topology in this list will be described in the 
next section. 


3. The Zariski Site 


Algebraic geometry is naturally concerned with functions which are 
rational or algebraic. We will now explain how they can best be handled 
by using sheaves on a suitably constructed site. The Zariski site will be 
defined on a category of algebras over an arbitrary commutative ring k, 
but we will begin with the classical case where k is the field C of complex 
numbers; then C” is the usual n-dimensional complex affine space. 

Given a polynomial f(21, x2) in two variables x1, and x2 and with 
complex coefficients, its locus is the algebraic curve which consists of 
all those points (21, z2) in the complex plane C? with f(z, z2) = 0. 
Similarly, an algebraic curve in three dimensions may be described as the 
locus of a suitable pair of polynomial equations in three variables. More 
generally, m polynomials f,,..., fm in the polynomial ring C[z1,..., Zn] 
have as a locus the set 


V(fi,..-,fm) = 
{Gc PICO Pa IA PEA ae) 


Such a locus is called a complex affine variety. Any such variety V 


determines an ideal in C[z,,..., 2p], namely, the ideal of all those poly- 
nomials which vanish at every point of V. Conversely, given an ideal I 
in the polynomial ring C[z,,..., 2p], the corresponding variety is 


V(1) = {(21,---, Zn) € C” | f(zi,..-,2n) =O forall fe}. (2) 


Evidently, the ideal I = (f,,..., fm) generated in C[z,,...,¢n] by the 


polynomials fi,..., fm has V(I) = V(fi,.-., fm): 
Different ideals may well generate the same variety. Thus, every 
ideal I determines another ideal 


VI ={f | for some positive integer r, f” € I} (3) 


called the radical of I. Clearly V(I) = V(VI). The famous Hilbert Null- 
stellensatz states that different radical ideals in C[z,...,2,] generate 
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different affine varieties; in other words, /I 4 VJ implies V(I) 4 V(J). 
The maximal ideals in C[x,,...,2,] are the ideals (1) — a1,..., 2p —@n) 
determined by n complex numbers aj,...,a,; the corresponding variety 
is the point with coordinates @1,...,@,; it is clearly a minimal algebraic 
variety. Any prime ideal P (an ideal P such that fg € P implies f € P 
or g € P) is a radical ideal; that is, P = VP. The corresponding affine 
variety V(P) is irreducible, in the sense that it is not the union of a finite 
number of smaller affine varieties. It can be shown that every radical 
ideal (i-e., every ideal which is equal to its own radical) can be repre- 
sented uniquely as the intersection of a finite number of prime ideals. 
This corresponds to the statement that any complex affine variety can 
be written uniquely as a finite union of irreducible such varieties. 

The relevant functions on an algebraic variety are those given by 
rational functions of the coordinates. On the ordinary cartesian plane 
with coordinates x, y such a rational function f(x, y)/g(z, y) is not de- 
fined in the whole plane, but only where the denominator g(x, y) is not 
zero, hence only in the complement of the algebraic curve defined by 
g(x,y) = 0. In higher dimensions, such functions are correspondingly 
defined on the complement of some variety V(I). Hence, it is natural to 
consider such complements as if they were open sets in a topology. They 
are, of course, open sets in the standard topology of C”. But one may 
also introduce a different topology on the set C” by decreeing that the 
closed sets are exactly the varieties V(I) for all ideals I; the usual axioms 
for a topology then hold. One may also describe this topology by say- 
ing that the complements of irreducible varieties (determined by prime 
ideals P) form a subbasis for the open sets. Thus there are many fewer 
open sets than in the usual Euclidean topology on C”. One calls this 
new smaller topology the Zariski topology on C”. The locus of a single 
polynomial f(2,,...,2n) = 0 is called an algebraic hypersurface (ie., a 
variety of dimension n — 1). Clearly the complements of hypersurfaces 
form a subbasis for the Zariski topology. In terms of this subbasis, one 
can describe a simple open covering of C”: take ¢ polynomials f,,..., ft 
such that the identity f; +---+ f; = 1 holds. The corresponding t 
hypersurfaces then have no common point, so the complements of these 
hypersurfaces form an open cover of C” in the Zariski topology. The 
Zariski topology on C” also restricts to give a Zariski topology on any 
affine variety V in C”. 

Associated with the Zariski topology there is a standard structure 
sheaf. To define it consider the field F = C(2,...,2,) of all formal 
rational functions h = f/g, where f and g are polynomials in the inde- 
terminates 71,...,2n with complex coefficients and g is not identically 
zero, Such a formal function f/g is not actually defined at every point of 
C” but these formal rational functions will yield a sheaf for the Zariski 
topology, as follows. One says that a rational function h is defined at 
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a point Q = (a1,...,@,) of C” if there is a Zariski open set W con- 
taining the point Q, such that h can be written as a quotient h = f/g 
in which the polynomial g does not vanish at any point of W; this of 
course means that the formal rational function A yields an actual func- 
tion W — C defined everywhere in the Zariski open set W. Now let U 
be any Zariski open set, and define O(U) to be the set of all rational 
functions A which are defined (in the above sense) at every point Q of 
U. [For convenience, when U is empty we take O(@) to be the set with 
one point, which we regard as if it were the unique rational function 
defined over the empty set.] The set O(U) is clearly a ring; indeed, for 
U # @, it is a subring of the field F. Moreover, an inclusion U’ C U 
of Zariski open sets evidently gives a homomorphism O(U) > O(U’) of 
rings [restrict each h € O(U) to points of U’|. Thus, O is a presheaf of 
rings for the Zariski topology of C”. One can prove that it is actually 
a sheaf, because rational functions can be patched together where they 
match. It is the desired structure sheaf. Moreover, the stalk of this 
sheaf at a point p of C” consists of the germs of those rational func- 
tions defined in some (Zariski) open neighborhood of p. This stalk is a 
ring with a unique maximal proper ideal—the ideal of all those germs 
which vanish at the point p. Such a ring, with a unique maximal ideal, 
is customarily called a “local ring”. Thus, algebraic geometry naturally 
leads to sheaves with such stalks. Thus, the Zariski topology on C”, or 
on an affine variety V C C” as described above, is an ordinary topology 
(rather than a Grothendieck topology). It gives rise to a Grothendieck 
topology on the partially ordered set of Zariski open subsets of V, with 
covers given by the usual covering families, as explained in the previous 
section. 


More generally, one may consider the category of all affine varieties 
V © C® (for various n > 0); a morphism ¢: V — W in this category 
between two such varieties V = V(I) C C" and W = W(I’) C C™ is the 
function given by an m-tuple ¢ = (A),...,Am) of rational functions of 
21,-..,£n with the property that each A, is defined at every point of V 
and such that (h),...,4m), viewed as a function V — C™, maps V into 
W. We can equip this category of complex affine varieties with the open 
cover topology, which is the Grothendieck topology given by covering 
families of Zariski open sets; it is a special case of the construction in §2, 
Example (b). We should add that this makes sense because the category 
of complex affine varieties is closed under taking limits and under taking 
Zariski open subspaces. It is easiest to see the latter in a special case. 
If V is an affine plane curve, say the locus of g(x,y) = 0, while U is a 
Zariski open set in V, of the form U = V—V(f), for another polynomial 
f, then U is isomorphic to the variety W in three-space which is the 
locus of g(x,y) = 0 and zf(z,y) = 1 (draw a figure). Much the same 
construction applies in general, as the reader may verify. 
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This elementary description is just the starting point for the use of 
sheaves in algebraic geometry. First, the ideas of algebraic geometry ap- 
ply to varieties defined over fields different from the complex numbers, 
and even to fields of characteristic p. Moreover, the study of diophan- 
tine equations with integral coefficients leads to “arithmetic” algebraic 
geometry over rings such as Z or rings of algebraic integers. These ideas 
combine to motivate the definition of the Zariski site for an arbitrary 
commutative ring k. 

For this construction, we will need to construct certain “fractions” 
from a commutative ring A (always with the identity element). For each 
a € A, one may form a ring A[a~"] of quotients, consisting of all frac- 
tions b/a” for b € A and n a natural number, where these fractions are 
equated, added, and multiplied by the usual rules. Then b+ b/1 is a 
ring homomorphism A — A[a~"] which is universal among homomor- 
phisms from A into rings in which a becomes invertible. One may also 
set A[a~1] = A[x]/(ax—1). (Note incidentally that if a is nilpotent, then 
the ring A[a"] is the zero ring.) We will also use the corresponding con- 
struction starting from a prime ideal P in A, giving a homomorphism 
A— Ap universal with the property that all a € A which are not in P 
become invertible in Ap. 

For an arbitrary commutative ring k one can still define algebraic 
varieties in the n-dimensional affine “space” k”. Given an ideal I in the 
polynomial ring k[x1,...,c,], the corresponding variety V(I) consists of 
all the points (a),...,@,) € k” with a; € k such that f(a@,,...,@,) =0 
for every f in I. However, there is no analog of the Hilbert Nullstel- 
lensatz, so there are not “enough” points to distinguish varieties which 
“ought” to be different. Therefore, one works not with the varieties 
themselves, but with the associated quotient rings k[z1,...,2n]/I. So 
one defines a finitely presented (“fp”) k-algebra A to be one of the form 
A= k[r,...,2m]/(fi,--+, fm) where the f, for j = 1,...,m are poly- 
nomials in the indeterminates x, with coefficients in the ring k. Let 
(k — Alg)tp denote the category of all these algebras. One can observe 
that a morphism of such f. p. algebras will induce a map in the op- 
Posite direction on the corresponding varieties. Hence, we will use the 
Opposite of this category. Observe also that a ring in this category may 
have nilpotent elements; such elements are used for deformations. They 
Provide a way of distinguishing, say, the “variety” defined by x — y = 0 
from that given by (x — y)? = 0! 

Now our objects are not sets of points but algebras, so we cannot 
use functions defined on points, but must use sheaves defined on this 
category. Instead of open covers, we will imitate the open covers of C” 
by the complements of hypersurfaces as described at (3) above. Thus a 
“cover” of a k-algebra A will be determined by a finite list of elements 
@,...,@n of A, to imitate the hypersurfaces given by setting a; = 0, and 
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such that the identity element 1 of A is contained in the ideal (a),..., an) 
generated by the a;. This is to reflect the intention that the “hypersur- 
faces” have no common point, so that these “complements” can cover 
C”. This means that we do not define a classical topology but rather a 
base for a (Grothendieck) topology on the category (k — Alg)p)°P. 

A cover of an algebra A for the intended basis is the dual, i.e., the 
opposite, of a finite family of the form (up to isomorphism) 


{A— Afap"] |¢=1,...,2}, (4) 


where each Ala; '] is a ring of quotients, as described above, where the 
maps A — Ala;"] are the canonical (universal) homomorphisms, and 
where 1 is in the ideal (a1,...,@,) of A. [Notice that if A is a finitely 
presented k-algebra, then so is A[a;"], since the latter is isomorphic to 
Alz|/(aa; — 1) 

This definition of a basis for a Grothendieck topology does satisfy the 
axioms for a basis given in §2, Definition 2. Clearly, the dual of the iden- 
tity map A — A is a one-element covering family. The stability condi- 
tion (ii’) is satisfied, as follows: Pullbacks in ((k—Alg)s,)°P exist, because 
they correspond to tensor products in (k—Alg)sp. Thus, if a1,..., @n are 
elements of A with 1 € (a1,...,@n), while h: A > B is an algebra homo- 
morphism, then 1 € (h(a1),...,A(@n)) and B@,4(Alaz"]) & BlA(a;)- 3]; 
Le., the diagram of algebras 


A———> Ala;*] 


| | 


B—_> Blh(a;)~"] 


is a pushout. 
To prove the transitivity condition (iii’) for a basis, suppose we have 
the dual of a cover (4), and for each index i another dual of a cover 


{ Ala] — Alay "I[e,,"] |F=1,--.,ni}, (5) 


where c,, is an element of Ala; "] and 1 € (¢j),...,Cin,), an ideal in 
Ala;"]. Now each c;; has the form b;,/a,"", for some integer m,, and 
some element b;; of the original algebra A. Therefore inverting c;, in 
Ala; *] amounts exactly to inverting };;, and we may write these dual 
covers (5) as 


{ Alay] — Alay] [b5"] — Al(aibig)“"] | 4,5}, (6) 
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where 1 € (bj,,...,bin,) as an ideal in A[a;"]. Multiplying away the 
denominators which are powers of a; gives a natural number k; such 
that 

a;™ € (ajbi1,..-, @:din,) 


as an ideal in A. Since 1 € (a1,...,@,) in A, say 1 = }° dja;, one can 
choose k > n- max(k;) so that 


l= ‘o> das)” €(ay span), 


and the latter ideal is contained in the ideal of A generated by all the 
a;b;;. Thus, the family of composites A > A[a;| > A[a;;;] is the dual 
of a cover, as required for transitivity. 

Much like the sheaf O discussed above, there is a canonical structure 
sheaf on the Zariski site (see §4), which is again a sheaf of local rings; 
this aspect of the Zariski site will be discussed in Chapter VIII, §6, on 
classifying topoi. 

This example of the Zariski site indicates why the notion of a 
Grothendieck topology became essential for algebraic geometry. The 
reader may wish to consult many other such examples, such as those 
arising from the use of schemes and of the étale topology in algebraic 
geometry. 


4. Sheaves on a Site 


Sheaves on a site can be defined in much the same way as sheaves 
on a topological space. 

Let C be a small category and J a Grothendieck topology on C. A 
presheaf, as before, is simply a functor 


P: C°P — Sets; 


the category of all such functors and their natural transformations has 
been discussed in Chapter I. If P is such a presheaf and the sieve S is 
& cover of an object C of C, a matching family for S of elements of P 
is a function which assigns to each element f: D — C of S an element 
Ff € P(D), in such a way that 


Le + G=LFfg for all g: FE = Din C. (1) 
Here, fg is again an element of S, because S is a sieve, while zy - g 
Stands for P(g)(xs)—just as in Chapter I. An amalgamation of such a 


matching family is a single element x € P(C) with 


a-f=vy for all f € S. (2) 
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Then P is a sheaf (for J) precisely then, when every matching family 
for any cover of any object of C has a unique amalgamation. 

A sieve S on C is the same thing as a subfunctor of yC = 
Hom( —,C); so a matching family f +> xy for f € S is the same thing 
as a natural transformation S — P. Thus, a presheaf P is a sheaf iff, for 
all covering sieves S' of objects C’, any natural transformation f: S — P 
has a unique extension to yC, as in the diagram 


gs—_f.p 


he 
yC. 


In other words, P is a sheaf iff for every covering sieve S on C' the 
inclusion S — yC induces an isomorphism [like II.2(2)| Hom(S, P) = 
Hom(yC, P). 

This definition can be expressed diagrammatically, just as in the case 
of topological spaces, by requiring that for each object C of C and each 
cover S € J(C) the diagram 


P(C) >|] P(domf) ==] (dom) 


feEs fig fEs, (3) 
dom f=cod g 


is an equalizer of sets; here e is the map e(x) = {x- f}¢ = {P(f)(z)};, 
the second product ranges over all composable pairs f, g with f € S 
(hence also fg € S), the map p comes from composition in C, and a 
comes from the action by C on P—so if x = {xs} ses in] ] peg P(dom f), 


P(X)f.g=Zfg, AX)pg =2y-g. (4) 


If (3) is an equalizer for a particular cover S, i.e., if every family 
for this cover S has a unique amalgamation, we say that P satisfies the 
sheaf condition with respect to the cover S. 

Now, let K be a basis for a topology on a category C with pullbacks, 
and write J for the topology generated by K [as in (2) of §2]. The sheaves 
for J can then be described purely in terms of the basis K, as follows. 
If R= { fi: CG; 3 C |i € I} is a K-cover of an object C, a family of 
elements x; € P(C;) (i € I) is said to be matching for R iff 


Lsaj-m?, for alli, jE I, (5) 


1 


where 7! and 7? are the projections from the pullback, as in 
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“| [+ (6) 


An amalgamation for {x;} is an 2 € P(C) with the property that z- f; = 
vi for alli € I. 


Proposition 1. Let P be a presheaf on C, Then P is a sheaf for 
J iff for any cover { f;: C; ~ C |i € I} in the basis K, any matching 
family {x;}; has a unique amalgamation. 


Before giving the proof, we note that Proposition 1 can also be ex- 
pressed by an equalizer diagram: 


Proposition 1 [bis]. A presheaf P on C is a sheaf for J iff for any 
cover { fi: C; 2 C |t © I} € K(C) in the basis, the diagram 


Cc) ][ PG SPC; xo C;) (7) 


ier 


is an equalizer. Hee e(x) = {x- fii and pi({zi})i,j = i° jj, While 
po({xi})ij = 25-7 en where mj; and us . are the projections of the pullback 


(6). 


Proof: We leave it to the reader to verify that this second state- 
ment is just a reformulation of Proposition 1 above, and prove the first 
formulation. 

(=>) Suppose P is a sheaf for J, and let R= { f;: C; > C|ieI}e 
K(C) be a cover in the basis K. Let {x;}; be a matching family for this 
cover. Now consider the sieve 


k : 
SR =(g PSC ge Doo 4 some i and k} 
Senerated by R [cf. (2.4)], and define a matching family {y,} es by 
Yg = 2; -h, 

Where A is any map such that g = f; oh for some i. This does not 
depend on the choice of i and A, for if f;oh = g = f; ok, then using the 
Universal property of the pullback square (6), there is a map @ to the 
Pullback with 1}, 0 @ = h and 7,0 £ =k; so 


xi h = (a; - Tj ig) & = (ape i) = a;- k 
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(the middle one of these three equalities holds because {z;} is matching 
for R). Thus, there is a unique y € P(C) with y- 9 = yg for all g € (R) 
because P is a sheaf for J. In particular y- f; = ys, = xi, so y is also 
an amalgamation for {z;}. Moreover, an amalgamation for {zi }:e7 is 
unique, for if y’ is another one with y'- f; = x;, then for any g € S = (R), 
say g = fi; oh, we have y’-g = (y’- fi)-h= 2, -h = yg, so y’ is also an 
amalgamation for the cover (R), and therefore y’ = y. 

(<=) Suppose S' € J(C) is a cover of C’, so that there is an R € K(C) 
contained in S [cf. (2.2)], and let { yg | g € S} be a matching family for 
S. Then clearly the subfamily {ys | f € R} is also matching [in the 
sense for a basis, cf. (5) and (6)], so there is a unique y € P(C) with 
yf =yy for all f € R. It remains to show that y-g = yy for all g € S. 
To this end, take g € S, and consider for f € R all the pullback squares 


D XC ou PF.g9 Cc’ 


D—;—¢. 


Then {7;,, | f € R} © K(D) by the stability axiom for the basis, and 
moreover for any f € R, 


(y-9)- 9 = (yf) Pry (by the square) 


= us Pp (since f € R) 
= Yfops.g (since { yg | g € S} is matching) 
= Ygors, (by the square) 

= Ug" hig 


Now if f”: C” — C is also in R, pullback of a7, along zy 4 shows that 
Y°9°Ts,g matches y-g- 7,9. Since R’ = {as, | f € R} is a cover 
in K and the family { y- 9-7, | f € R} matching on R’ has a unique 
amalgamation, we must have y-g = yg, and the proof is complete. 


We have already seen many examples of sheaves in Chapter II, in 
the case where C = O(X) is the category of open subsets of a topo- 
logical space, equipped with the open cover topology. For instance, if 
Y is any other space, the continuous Y-valued functions from open sets 
in X constitute a sheaf on X—because matching continuous functions 
can be pieced together. This observation has a more general form, for 
any topological site T equipped with the open cover topology, as in §2, 
Example (b). In this case, for every Y, the representable presheaf 


y(Y) = T(-—,Y): T° — Sets 
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is, in fact, a sheaf (y: T > Sets™” is the Yoneda embedding) because 
if {U; | 7 © I} is an open cover of a space T in T, then any family 
of functions {a,;: U; — Y} which agree on intersections U; M U; clearly 
patch together to give a uniquely defined function a: T —- Y. 

For a differentiable manifold, the sheaf of germs of smooth functions 
plays a central role. Analogously, for algebraic geometry, one has the 
structure sheaf O on the Zariski site over a commutative ring k. Here the 
category is the category A which is the dual of the category (k — Alg)¢p 
of finitely presented k-algebras A, with their homomorphisms, while the 
Grothendieck topology is described by the basis in which the coverings 
of an algebra A are the duals of those families of the form 

{A— Alay") |i=1,...,n} (8) 
for which the ideal (a),...,@n) of A contains 1. 

The forgetful functor O: (k — Alg)s, — Sets which sends each al- 
gebra A to its underlying set A defines a presheaf on A. (It can also 
be described as a presheaf of commutative rings.) We claim that this 
presheaf O is actually a sheaf; it is called the structure sheaf for the 
Zariski site. To prove this, it is enough (by Proposition 1 [bis] above) to 
show for each cover (8) that the diagram 

A—> || Ala>") == TJ Al(aia;)“1] (9) 
t tg 
is an equalizer of sets (and hence of rings). Indeed, one has the isomor- 
phism 
A[(ajaj)~*] = Alay *] @4 Ala" 
in (k — Alg)s, which means that A[(a;a;)~'] is the pullback over A of 
Ala; "| and Ala;" in the opposite category. To see that (9) is indeed 
an equalizer, consider any elements x; € A[a;*], i= 1,...,n, such that 
<; = 2x; in A[(aja,)~*| for all indices i and j7. We may set x; = y,/a™ 
for some y; € A and all i, for m sufficiently large; then 2; = 2; in 
A[(a;a;)—1] means that 
ya (aja;)* = yal (aja;)* (10) 


for some sufficiently large k > 0. Now 1 € (a,,...,@,) in A; by raising 
the corresponding equation to the power n(m +k) we may write 1 = 
Ltamt* for suitable t;. Let z = > tiya*. Then 


aris = tiysag al t* 
i 
= a trate aky, by (10) 
i 


Sn Ree 
= ajyj- 
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Therefore, in Alay] one has x = aky;/art* = y;/aP = xj. This shows 
that x is an amalgamation for the 2,’s. 

It remains to prove that this amalgamation is unique, i.e., that the 
map e: A — J] A{a;"] of (9) is mono. To this end, suppose e(x) = 0 
for some x € A; in other words, that z = 0 in A{a;"| for each i. Then 
for a sufficiently large m, we have af’x = 0 in A for each i. But since 
1 € (a),...,@,), one can write 1 = 5° s;a7", and hence z = }* s,ay"z = 0 
in A, Thus, e is mono, and the proof that O is a sheaf is complete. 

It can be shown that O is a local k-algebra object in the category 
of sheaves on the Zariski site and that O is the universal such object in 
some appropriate sense, to be treated in Chapter VIII, 86. 

Notice that © is, in fact, isomorphic to the representable presheaf 
Hom(k(z], — ) corresponding to the k-algebra k[z]; so this example of a 
sheaf on the Zariski site is similar to the preceding example concerning 
the site T. It can be proved that for any finitely presented k-algebra, 
the corresponding representable presheaf is a sheaf on the Zariski site 
(Exercise 7(a)). 

More generally, if J is a (basis for a) topology on a category C such 
that all representable presheaves on C are sheaves, we will call the topol- 
ogy J subcanonical. The canonical topology is the largest subcanonical 
topology on C. 

The topology on T given by open surjections [§2, Example (c)] is 
subcanonical: indeed, if { f;: ¥; + X} is a basic cover so that the 
induced map f: [| Y; — X is an open surjection, then a matching 
family of continuous functions g;: Y; — Z into some other space clearly 
amalgamate to a unique function g: X — Z such that go f; = gq; for all 7; 
however, the point is to show that g is continuous. Now for an open set 
UC Z, 91 (U) = ff-'g71(U) since f is surjective; so since f is open, 
it is enough to show that f~1g—1(U) is open in [[Y;. But for each 
summand Y; C [] Yj, we have f-'g-1(U) NY, = f7'g-1(U) =97'(U), 
and this is an open set since g; is continuous. Since f~'g—'(U) NY, is 
open for each j, it follows that f—'g~!(U) is open. 

Let us consider the atomic topology on a category C as in §2, Exam- 
ple (f). Here sheaves can be described in the following simple fashion. 
(See also Exercise 13.) 


Lemma 2. A presheaf P is a sheaf for the atomic topology on C 
iff for any morphism f: D —- C and any y € P(D), ify-g =y-h for all 
diagrams 

E=3D—+.¢c 
with fg = fh, then y = x- f for a unique x € P(C). 

Because of this fact, one sometimes says informally that “every mor- 

phism f is a cover” in the atomic topology. 
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Proof: (=) Suppose y € P(D) has the property that y-g =y-h 
for all g, h with fg = fh. Consider the sieve S = (f) = {t | ¢ = 
fok for some k } generated by f, and define a matching family {x}: 
for S by x, = y-k where t = fok. This does not depend on the choice of 
k, by the hypothesis on y. Since P is a sheaf, there is a unique x € P(C) 
with x-t = 2x, for all t € S; in particular x- f = y. Since any other 2’ 
with 2’ - f = y would satisfy c’-t = x, for all t € S, x is also the unique 
one with x-f=y. 

(<=) Let S be a covering sieve of C, i.e., any nonempty sieve, and 
let {x¢}se5 be a matching family of elements of P. Fix fo € S. Clearly, 
fog = foh implies xs, -g = xy, -h, so we find a unique x € P(C) with 
2+ fo = Czy. It remains to show that x; = 2+ f for all f € S. So choose 
f € S, and consider some commutative square 


[such a square must exist in C, for otherwise the atomic topology cannot 
be defined; see §2, Example (f)|. Let y = 27,-v = z- fov € P(D). If 
g, h: E — D are two morphisms with ug = uh, then fovg = fovh, so 
yg = yh. Thus, by hypothesis, there is a unique z € P(A) with z-u = y. 
But xy for f € S is a matching family, so x» and x - f both satisfy this 
requirement; so x» = x: f. 


Let us return to the general context of a site (C, J). The sheaves on 
(C, J) form a category, where the maps are the natural transformations, 
i.e., maps of presheaves. So the category of sheaves, for which we write 
Sh(C, J), is a full subcategory of the functor category Sets©” 


Sh(C, J) — Sets@”. (11) 


Definition 3. A Grothendieck topos is a category which is equiva- 
lent to the category Sh(C, J) of sheaves on some site (C, J). 


Given such a topos, it is not always obvious how to find such a site 
(C, J). For instance, the category BG of continuous G-sets [Chapter I, 
§1, Example (xi)] is a Grothendieck topos, and we will construct a site 
for it in §9 below. In the appendix we will prove Giraud’s theorem, 
which will enable us to recognize Grothendieck topoi more easily. We 
also remark that given a Grothendieck topos G, a site (C, J) such that 
G = Sh(C, J) is by no means unique [cf. Chapter II, Exercise 4, and §9, 
Theorem 2, below]. 
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To conclude this section, we prove that limits of sheaves are sheaves. 
Recall that if I + Sets©”, i 4 P,, is a diagram of presheaves (7 any 
small category), the inverse limit in Sets” is computed pointwise, i.e., 


(lim P,) (C) = lim P,(C), (12) 


where on the right one just takes a limit of sets. 


Proposition 4. Let (C,J) be a site, and let I + Sets©” be a 
diagram of presheaves P;. If all P; are sheaves then so is lim P;. 


Proof: Let P = lim P; be the limit in the category Sets©”” of 
presheavyes, so P(C) = lim P;(C). If S is a cover of an object C’, then 
ras 
by assumption we have an equalizer 


P,(C) —> [J Pi(dom f) == [J Pi(dom g) 
fes fig 


of the form (3). Since limits commute with limits ((CWM], p. 227), 
taking the inverse limit of all these equalizers again gives an equalizer 
of the form (3), 


P(C) —> |] P(dom f) —= |] P(dom), 
fees 


so P is a sheaf. 


5. The Associated Sheaf Functor 


In this section, C is a small category, and J is a Grothendieck topol- 
ogy on C. As in §4, Sh(C, J) denotes the full subcategory of Sets©” 
consisting of J-sheaves. Our aim is to prove the following result, which 
extends a basic result on sheaves for topological spaces (Chapter I). 


Theorem 1. The inclusion functor i: Sh(C,J)—Sets©” has a 
left adjoint 
a: Sets©” — Sh(C, J), 


called the associated sheaf functor. Moreover, this functor a commutes 
with finite limits. 


This last sentence of the theorem makes sense, because inverse limits 
exist in Sh(C, J), see Proposition 4.4. 

Let P be a presheaf on C, i.e., a functor C°? + Sets. The con- 
struction of a(P) proceeds in two steps: first we will construct a new 
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presheaf P+, which is not yet necessarily a sheaf, but is halfway there. 
Call a presheaf P separated if a matching family can have at most one 
amalgamation. In other words, P is separated if for any x, y € P(C) 
and any cover S € J(C/), x: f = y- f for all f € S implies x = y (here 
C is an arbitrary object of the category C). So separated presheaves 
satisfy the uniqueness condition occurring in the definition of a sheaf, 
but not necessarily the existence condition. We will show that for any 
presheaf P, the newly constructed presheaf P+ is separated, and that 
P* is a sheaf if P itself is already separated. Then a(P) is obtained by 
applying this plus-construction twice, as 


a(P) = (Pt)t. 


To understand the definition of Pt, notice that if P were to be a 
sheaf, then for any cover R of an object C of C, a matching family 
(z)ser of elements of P should actually represent a unique element 
of P(C). Moreover, still assuming P to be a sheaf, if S € J(C) isa 
refinement of R, ie., S C R, then the subfamily (zy) se5, which is a 
Matching family for the cover S, actually represents exactly the same 
element of P(C). 

Given an arbitrary presheaf P, we might therefore try to define a 
new presheaf Pt by 

a _.} 
P ae Match(R, P), (1) 
where Match(R, P) denotes the set of matching families for the cover R 
of C, and the colimit is taken over all covering sieves of C’, ordered by 
reverse inclusion. In other words, an element of P*(C) is an equivalence 
class of matching families 


x={2as|f:D->CER}, axpeEP(D), and azy-k=ayzx, (2) 


for all k: E —+ D, where two such families x = {xy | f € R} and 
y = {y, | 9 € S} are equivalent when there is a common refinement 
T CRNS with T € J(C) such that x; = yy for all f in T. 

The so-defined P+ has the structure of a presheaf, where the restric- 
tion map Pt+(C’) + P*(C’) for a morphism h: C’ — C in C is given 
by 


{ey|fER}-h={anyp | f Eh*R}; (3) 


observe that this is well-defined on equivalence classes. This construction 
P \. P+ is a functor of P, because each map @: P — Q of presheaves 
(ie., each natural transformation ¢) induces in the evident way a map 
+: P+ —+ Qt. Moreover, there is a canonical map of presheaves 


n: P— Pt 
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defined for each x € P(C) as the equivalence class of the matching family 


no(a)={2-f| f etc}, (4) 


where tc is the maximal sieve on C’, as in §2. 

As an illustration, let us look at some instances of the plus- 
construction for presheaves on a topological space X (with the open 
cover topology). Take for example the presheaf B, where each B(U), for 
U open in X, is the set of all bounded continuous functions f: U > R. 
In general, B need not be a sheaf because if X has an open subset 
U which is not compact, there may be an infinite open cover of U on 
which a matching family, bounded on each set in the cover, may com- 
bine to give an unbounded continuous function. But consider the sheaf 
C' where C'(U) is the set of all continuous functions U — R; it is a sheaf 
and there is an evident map Bt(U) — C(U) because each matching 
family of bounded functions f;: U; —> R on a cover U, of U combine 
to give a uniquely determined continuous function f: U — R; equiv- 
alent matching families give the same f. Conversely, every continuous 
f: U — R arises in this way from a matching family of bounded func- 
tions f,: U, —> R, where U,, consists of all 2 € U with |f(z)| <n, and 
fn is the restriction of f to U,. Hence, in this case one application of 
the plus-construction turns the presheaf B into its associated sheaf C’. 

However, one application of the plus-construction will not always 
suffice. For consider any set S with more than one element, and the 
corresponding constant presheaf P on the space X, given by P(U) = 
S for every open set U, with all restriction maps the identity of S. 
Then Pt(U) = S if U # 0, while P*+(@) = 1 (for the empty cover, 
there is exactly one family, the empty one). Thus, P* is not a sheaf 
because if U; and U2 are disjoint open sets of X, then s; € P*+(U;) and 
sq € P*(U2) must have as a restriction to P+(@) the unique element 
of P+ (0), so {s1,s2} is a matching family for the cover {U,, U2} of 
U, U U2. However, if s; and so are different elements of S,, they clearly 
cannot have an amalgamation in P+. But, consider P+t+. An element 
of P*++(U) is an (equivalence class of) sets of elements s; € P(U,) for 
some open cover {U;} of U, which match (s; = s,) whenever the overlap 
U;NU; is nonempty. Thus these elements s; € S piece together to give a 
function s: U — S with the property that every point of U has an open 
neighborhood (e.g., some U;) on which the function s is constant. In 
other words, P++(U) is the set of all locally constant functions s: U — 
S. The P+* so described is evidently a sheaf; indeed, it is the associated 
sheaf of the constant presheaf P, as already constructed in Chapter II, 
Exercise 7. 

In general, the plus-construction P* has unique amalgamations only 
for matching families of elements coming from P, so one might imagine 
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that one would need to apply the construction some transfinite number 
of times to turn the presheaf P into a sheaf. But, as we will show, it 
suffices to apply it only twice, as in the last example above. 

As a first elementary property we need the following. 


Lemma 2. (i) A presheaf P is separated iff n: P — Pt is a 
monomorphism. (ii) A presheaf P is a sheaf iff ny: P —> P* is an iso- 
morphism. 


Proof: This is immediate from the definitions. For (i), take z and 
y € P(C), for some object C of C. Then n(x) = n(y) means that 
x: f=y-f for all f in some covering sieve S. This implies that x = y 
precisely when P is separated. The proof of (ii) is equally obvious. 


Lemma 3. If F is a sheaf and P isa presheaf, then any map ¢: P + 
F of presheaves factors uniquely through 7 as ¢ = @o0 7: 


¢ (5) 


Proof: An element of P+(C) is represented by a matching family 
{xs | f € R} of P for some cover R of C. Then for any h: D > C 
in R the definition (4) of 7 shows that np(z,) = {2,-k | k € tp}; on 
the other hand, {z; | fe R}-h={any | f’ € A*R}. But h*R is 
the maximal sieve tp since h € R; so since { xf | f € R} matches, one 
obtains an equality 


npo(tx)={ars|fER}-h, forh: D>CimR. 


Therefore, if the described map ¢ in (5) were to exist, o({z fl f ERS) 
must be the unique element y € F'(C) with 


yh= ({as| f €R})-h=Ol{zz| f © R}-h] = o(nv(2n)) = Gan) 


for all h € R. But such a (unique) y € F(C) indeed exists because F is 
a sheaf and { ¢(z,) | h € R} is a matching family of F. 


Lemma 4. For any presheaf P, P+ is a separated presheaf. 


Proof: To show P+ separated, we consider two elements x, y € 
P+(C) with x-h = y-h for all h in some cover Q of C; we must show 
that x = y. Now represent x and y as matching families x = {x fl 
f © R} and y = {yg | 9 € S} for covers R and S € J(C). The 
equality x-h = y-h for h: DC in Q means that there is some cover 
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Th © h*(R) NA*(S) of D such that 2,2 = ynz for all t €¢ T,. But by the 
transitivity axiom on covers, the family 


T={ht|heQ,teTn} 


is still a cover of C, and T C RNS; therefore x = y as elements of 
Pt(C). 


Lemma 5. If P is a separated presheaf, then P+ is a sheaf. 


Proof: To show P+ a sheaf, we must amalgamate any matching 
family of elements of P*+; such a matching family is given by a cover 
R € J(C) and elements {x; | f € R} which match; here x7 € Pt(D) 
when f: D — C. Now each x, is itself given as the equivalence class of 
a matching family for P, 


xp={rtpg|gESs}, g:E->D, x5 € P(E) (6) 


for some cover Sy € J(D). Also, the requirement that the x match for 
f: D — C means that for any morphism h: D’ + D one has xs-h = xfn 
as elements of P*+(D’). Using the definition (3) of “— -h”, this means 
that there is an equivalence of families 


{rpng |g €h*(Ss)} ~ {rpng 19 € Spr}, 


and this equivalence, in turn, means that there is a cover Ty, € A*(Ss)N 
Sth of D’ such that 


Lf,hg"! = Lfoh,g" for all g” € Ty: (7) 


Now let @ be the sieve {fog| f € R,g € S;}. Here both R and 
Sz are covers, so by the transitivity axiom Q covers C’. Then define a 
family y € Pt (C) for this cover Q by setting 


Yfog = Tyg. (8) 


We must show that this definition is independent of the choice of the 
factorization of f og. So suppose fg = f’g’ for morphisms f, f’ € A 
and g € Sy, g’ € Sy. Then if k € Tyg Tyg one has 


Leg *k=<XF ok (since xf is a matching family) 
= Zfog,k [by (7)] 
= ZL flog’k (by assumption) 
= Lyf g'k [again by (7)] 


=Ipigk (since xy, is also matching). 
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Since P is separated, by assumption, and since Ty, 1 Ty:,g is a cover 
of D’, we conclude that x, = x9. So y is well-defined by (8). Then 
because the xf are matching families, it follows that y = {y, | h «© Q} 
is a matching family; hence an element of Pt (C). 

Finally we claim that this element y is an amalgamation of the given 
matching family { x, | f ¢ R}. For this we must prove for each f: D 
C in R that 


y f={ypon|hef*Q} and xz={x4,9|9 € Ss } 


represent the same element of P*+(D). This indeed holds since S; € 
f*(Q) by the definition of Q and for any g € Sy, yfog = Tf,g by the very 
definition (8) of y. This proves the existence of an amalgamation. But 
since P+ is separated by Lemma 4, this amalgamation is unique. This 
completes the proof that P+ is a sheaf. 

With these lemmas we now deduce Theorem 1 above. Take the 
associated sheaf functor a to be a(P) = (P+)t. This defines a functor 
of the presheaf P and yields a sheaf by Lemmas 4 and 5. The map 
defined in (4) gives a composite map 


Pp —2P_, p+ pt) ptt (9) 


from P to the sheaf a(P). By two applications of Lemma 3, this compos- 
ite is universal among maps of the presheaf P to a sheaf F. Therefore, 
a is the required left adjoint to the inclusion of sheaves in presheaves, 
and the composite (9) is the unit of the adjunction. 

Note that if P is already separated only one step is needed in (9), 
while if P is a sheaf the unit (9) is an isomorphism, by Lemma 2. In 
other words, 


Corollary 6. The composite, inclusion followed by associated sheaf, 
aoi: Sh(C, J) > Sh(C, J), 


is naturally isomorphic to the identity functor. 


Notice that this is also a consequence of the general fact that the 
Tight adjoint i (as the inclusion of presheaves in sheaves) is full and 
faithful (see [CWM, p. 88]). 

It remains to show that the associated sheaf functor a preserves finite 
limits, as stated in Theorem 1. For this it is enough to show that the 
plus-construction P + Pt preserves finite limits. First we observe that 
for a fixed object C in C and a fixed covering sieve R € J(C), the functor 
P ++ Matcho(R, P) from presheaves P to Sets does preserve all limits; 
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in other words, for any functor P: I +> Sets©”” from an index category 
I to presheaves one has 


Matcho(R, lim P;) & lim Matcho(R, P;). (10) 


Indeed, as stated for the diagram below (4.2), we have for any presheaf 
P a natural isomorphism 


Matcho(R, P) = Homg,,,c°r (R, P). 


But clearly the functor Hom(R, — ) preserve limits. 

Finally, P*+(C) is defined in (1) as the colimit of Match over all 
covers of C’, ordered by reverse inclusion. This partial order is filtering 
(any two covers have a common refinement, their intersection). A well- 
known result ((CWM, p. 211]; see also VII.6, Corollary 5) asserts that 
filtered colimits commute with finite limits. Therefore, P + P+ does 
preserve finite limits, as was to be shown. 


6. First Properties of the Category of Sheaves 


In this section and the next, we will derive some basic properties of 
the category of sheaves on a site (C, J). In particular, we will see that 
the category of sheaves enjoys all the properties of our typical categories 
discussed in the preceding chapters; or to put it more briefly, that the 
category of sheaves on a site is an elementary topos (cf. §1.6). 

So from now on, we consider a fixed small category C equipped with 
a Grothendieck topology J, and we write 


Sh(C, J) —— Sets°” (1) 


for the adjoint pair provided by the sheafification process described in 
the previous section. 

We recall from (24) of the Preliminaries that arbitrary (small) limits 
exist in Sets©” , and that they are computed pointwise; one may express 
this by the formula 


(lim P,)(C) = lim P,(C); (2) 


here the left-hand limit is taken in Sets©”, while the right-hand one 
is in Sets. In §4, Proposition 4, we proved that Sh(C, J) ¢ Sets©”” is 
closed under limits. So Sh(C, J) has all small limits, and these limits 
are manifestly preserved by the inclusion functor i; this also follows from 
the fact that 7 has a left adjoint. 
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In particular, the empty limit—which is the terminal object 1 € 
Sets©” defined by 1(C) = {0} for all C—is a sheaf. The corresponding 
Hom-functor (for F' a sheaf) 


T: Sh(C,J) Sets, I'(F) = Hom(1,F) (3) 


[Hom in the category Sh(C, J)] is called the global sections functor. Re- 
call from §1.6 that the global sections functor on presheaves 


T: Sets” — Sets 
has a left adjoint A defined for each set S and each object C' of C by 
A(S)(C) = S; 


thus, for a set S, A(S) is the constant presheaf on C with value S. It 
now follows from Theorem 5.1 that [': Sh(C, J) — Sets also has a left 
adjoint, namely, the composite ao A: Sets — Sh(C, J). In the sequel, 
we will often abuse the notation and, instead of ao A, just write A for 
this left adjoint: 

A: Sets — Sh(C, J), (4) 


calling A(S) the constant sheaf associated to S. (See Exercise 14 for 
some examples.) 
A map ¢: F — G of sheaves is (by definition) just a map from F to 
G as presheaves. Because of the adjunction i 4 a of Theorem 5.1, ¢ is 
a monomorphism of sheaves iff it is a monomorphism of presheaves. In 
other words, the inclusion 7 both preserves and reflects monomorphisms. 
But a mono of presheaves is just a map which is a pointwise injection, 
so 
g: F — Gis mono in Sh(C, J) iff 5 
éc: F(C) — G(C) is an injection for all C € C. (5) 
The analogous statement for epis holds for presheaves; i.e., ¢: P — Q is 
epi in Sets©”” iff each P(C) > Q(C) is surjective. However, the similar 
statement concerning epis for sheaves fails (cf. Corollary 7.5, below). 
Another consequence of the adjunction of Theorem 5.1 is that all 
small colimits exist in Sh(C,J). The recipe for their computation is 
simply this: first compute the colimit in the category of presheaves, 
and then take the associated sheaf of the resulting presheaf, using the 
principle that left adjoints preserve colimits. Thus, for sheaves F;, 


lim F; © a(limi(F})), (6) 
where the colimit on the left is taken in Sh(C, J) and the one on the right 


in Sets". Recall that colimits of presheaves are computed pointwise, 
So there is a formula 


(lim P;)(C)=limP(C),  -P; € Sets”, (7) 
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analogous to the formula (2) above for limits. 

Our next purpose is to show that Sh(C, J) has exponentials. Let us 
remark first that if exponentials in Sh(C, J) were to exist, then they are 
necessarily constructed in the same way as are exponentials of presheaves 
(cf. 81.6). Or more precisely, if F and G are sheaves and the exponential 
GF exists in Sh(C, J) then 


GF) za. (8) 


This follows from the Yoneda lemma and the following sequence of bi- 
jective correspondences, natural in an arbitrary presheaf P € Sets©”: 


P—+i(G*) 
a(P) ——+ GF 
a(P) x F—+G 
a(P x i(F)) >G 
P xi(F) > i(G) 
P— ia)”, 


These follow from the adjunction a + i, together with the fact that 
aoi & id and a preserves products (Corollary 5.6 and Theorem 5.1), so 
that a(P) x F = a(P) x ai(F) = a(P x i(F)). 


Proposition 1. Let P, F € Sets©”” be presheaves. If F' is a sheaf, 
then so is the (presheaf) exponential FP. 


Proof: Recall from (1.6.(5)) that the elements of the presheaf ex- 
ponential F?(C) are the natural transformations 7: y(C) x P > F, 
where y denotes the Yoneda embedding C — Sets©’. Thus, 7 assigns 
to any morphism g: D — C' and any element x € P(D) an element 
7(g,x) € F(D). Naturality of 7 means that for each h: E — D one has 


T(gh, xh) = r(g,x)h. (9) 


The fact that F’? is a functor of C is expressed, for any arrow f: C’ > C, 
by 

(r-f)(g',z)=7(f9',2), (9g: D+C',ceEP(D)). — (10) 

First we show that the presheaf F? thus described is separated, if 


F is separated. Suppose 7 and o are two such natural transformations, 
while S € J(C) is some cover of C such that r-f = o-f forall f: C’ ~C 
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in S. This means that 7(fg’,x) = o(fg’,x) for all g’ and = as in (10); 
in particular (take g’ = 1) that 


T(f,) = o(f, 2) (11) 


for all f: C’ - C in the cover S and all « € P(C’). Now let k: C’ + C 
be any arrow to C while x € P(C’). Then for every g’ € k*(S) (ie., for 
kg’ € S) one has 


r(k, 2) +9’ = r(kg’, rq’) [naturality of 7, (9)| 


=o(kg',xg') [by (11)] 
= a(k,2)-g' (naturality of o). 


Then since k*S is a cover of C’ and F is assumed to be separated, 
it follows that 7(k,z) = o(k,z). Since k and x were arbitrary, one 
concludes that 7 = o. This shows that F? is a separated presheaf if F 
is. 

To prove that this exponential F’? is a sheaf if F is, we now need only 
show that amalgamations of matching families of elements of F? exist, 
since the uniqueness of any amalgamation follows from the fact that 
FP is separated. So suppose that S € J(C) is a cover and that for all 
f: D— Cin S we are given a natural transformation 77: y(D)x P > F; 
moreover, suppose that these 7; form a matching family. The latter 
means that ty, = Ts +g for every morphism g: KE — D and hence, by 
the definition (10) of 7 - g, that 


T4q(h, x) = (rz -g)(h, x) = Tz(gh, x) (12) 


for all h: E’ — E and all x € P(E’). To find an amalgamation of the 
family { 77: f € S} we will first construct from the cover S a natural 
transformation 7’: y(C) x P — Ft so that, for all f: D > C in S, the 
diagram of functors and natural transformations 


y(D)x P— GF 
sn |» (13) 
y(C) x P—_ Ft 


commutes. Since F is a sheaf, the map 7p on the right is an isomorphism; 
therefore, any such 7’ provides an amalgamation (nr)~! 07’ of the given 
family {7;: f € S}. 

To define 7 at an object B of C, we must define 7/(k, x) for any 
k: B — C and any element z € P(B); we can set 


r'(k,v) = {ten(1,2-h) | hE k*S}, (14) 
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provided that the right-hand side is an element of F'+(B); that is, is a 
matching family of elements of F' for the cover k*(S) of B. To check this, 
choose h € k*S and consider any morphism m for which the composite 
hm is defined; then 


Trn(1,x-h)-m = ten(m, x - hm) (naturality of 7x1) 
= Tkhm(1, a hm), [by (12)], 


so the right-hand side of (14) does match for the cover k*S, and it is 
clear that 7’ thus defined is a natural transformation y(C) x P > Ft. 

Moreover, the diagram (13) of functors for the 7’ as defined in (14) 
commutes. For consider any f: D — C in S, so that f*S = tp—the 
maximal sieve on D. Then for any element (k,z) in y(D) x P, where 
k: B + D and z € P(B), say, one has 


(r' 0 (y(f)  1))(, @) = 7'(fk, x) = { ryen(1,0-h) | A} 


by the definition (14) of 7’, where h ranges over (fk)*S = k* f*S = tp, 
the maximal sieve on B. On the other hand, by the definition §5(4) of 
n and by (12), 


nets(k, ©) = ne(rpe(1,2)) = { Thea(1,2-h) |b € te}; 


so the results agree. This shows that (13) commutes, and thus completes 
the proof of the proposition. 


As an example, let A be a complete Heyting algebra [§2, Exam- 
ple (d)|, while F and G are sheaves on A. If a € A, then the ideal | (a) 
generated by a is the set {b € A | b < a}. Considered as a category, 
this ideal has a Grothendieck topology inherited from A, in which “cov- 
ers are sups”. Moreover, F' and G restrict to sheaves F'|a and Gla on 
{(a). The exponential G* is the sheaf on A with elements the natural 
transformations 

7: Hom(—,a)x F >G. 


Since Hom(b, a) = @ unless b < a, in which case it is the one-element set, 
such a 7 is the same thing as a family of functions 7: F(b) — G(b), one 
for each b < a, and natural in b. In other words, G” is the sheaf with 


GF (a) = Hom(Fla, Gla), (15) 


where Hom is the set of morphisms (natural transformations) in the 
category of sheaves on |(a). [Note also that |(a) is in fact itself a cHa; 
see Exercise 17.| Observe that this formula (15) agrees with the one 
(11.8.2) for sheaves on a topological space. 
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As another example, let T be the topological site considered in either 
example (b) or (c) of 82. Then any topological space X (not necessarily 
one in T) represents a sheaf p(X) on T, namely, 


p(X) = Cts(—, X), 


where “Cts” denotes the set of continuous functions. In case X is lo- 
cally compact while Y is another space, the function space Y* can be 
given the usual compact-open topology. With this topology, there is an 
obvious natural transformation 


pY*) pvr, 


which is often an isomorphism (e.g., when X has an open cover by 
elements from T; see Exercise 18 for details). 

To conclude this section, we briefly consider the Yoneda embedding 
in the context of sheaves. If the topology on C is subcanonical, every 
representable presheaf is a sheaf, so the Yoneda embedding y: C — 
Sets©” factors through the inclusion Sh(C, J ) + Sets©”. However, 
even if J is not subcanonical, we can compose y with the associated 
sheaf functor a to give a canonical functor 


ay: C — Sh(C, J), (16) 


which however need not be full and faithful. For any sheaf F on 
(C, J), regarded as a presheaf iF, the Yoneda lemma states that 
F(C) = Hom(y(C), iF). By the adjunction of Theorem 5.1, this gives 


F(C) = Homgic, 7) (ay(C), F). (17) 


Every presheaf is a colimit of representables (Proposition [.5.1). 
Consequently, any sheaf F, regarded as a presheaf iF, has the form 
uF) = lim y(Cx) for some diagram of objects C, in C. Therefore, 
Since a as a left adjoint preserves colimits, 


= ai(F) ¥ alim, y(Ce) = lim (ay(Ck)). 


This implies in particular that the set of all the sheaves ay(C’) for C € C 
generate the category Sh(C,J). (Recall from [CWM, p. 123] that a 
Collection {Gg} of objects of a category A is said to generate A iff for 
any parallel pair of morphisms f, g: A — B in A one has f = g iff 
ft = gt for all maps t: Gg — A with G¢ any object from the collection 


{Ge}.) 
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7. Subobject Classifiers for Sites 


Perhaps the most important property shared by our typical cate- 
gories from Chapter I is the existence in each such category of a sub- 
object classifier 9. We will now show that there is such a subobject 
classifier for sheaves on a site. First recall (§II.8) that for sheaves on 
a topological space with the usual open cover topology, the subobject 
classifier 2. was the sheaf 2 on X defined by 


Q(U) = {V | V is open and V c U}. 


If here we replace each open set V by the corresponding principal sieve 
L(V) ={V'| V’ CV}, then 2 becomes 


0(U) = { S'| S is a principal sieve on U }. (1) 


Now to assert that a sieve S is principal is to assert that it is closed 
under arbitrary unions of its elements; i.e., that for all open W C U, 


S coversW => WeS. (2) 


It is this property of sieves (to be called “S' is closed”) that we will 
generalize to any site. 

Consider an arbitrary site (C, J). For a sieve M on an object C' and 
an arrow f: D — C, recall that f € M iff f*M is the maximal sieve on 
D, and (§2) that “M covers f” means that f*M € J(D). Now define: 
A sieve M on C is closed (for J) iff for all arrows f: D > C in C, 


M covers f => feM,; (3) 
or, equivalently, iff for all arrows f: D + C, 
{h-E—D| fhe M} covers D => feM. (4) 


Indeed, the condition on the left-hand side of (4) states that f* M covers 
D, ie., M covers f. (Note: the terminology “closed” is unfortunately 
now standard in this usage, and was perhaps suggested by the general 
idea of a closure operator; it has, however, no intuitive connection with 
the basic notion of a closed set in point-set topology.) 

The property of being closed is stable under pullback, in the sense 
that for any sieve M on C' and any morphism h: B > C, 


M is closed => h*M is closed. (5) 


Indeed, suppose h*(M) covers a given morphism f: D — B. By def- 
inition, this means that M covers the composite hf; hence since M is 
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closed, hf € M, or f €h*M. Thus, h*(M) is closed. This means that 
the definition 


Q(C) = the set of closed sieves on C’ (6) 


yields a functor 2: C°P —+ Sets, for which the restriction NC — NB 
along any h: B — C is given by 


M-h=h*M, (7) 


for any closed sieve M on C. 
From any given sieve S on an object C, one may construct a related 
sieve S on C' as follows: 


S ={h|h has codomain C, and S covers h}. (8) 


Notice that 9 is indeed a sieve [cf. the stability condition (iia) of §2]. 
This sieve S is closed. For suppose S covers g. By definition of S, 
the sieve S covers all arrows in S. Hence, by the “arrow form” of the 
transitivity condition [(iiia) of §2], S covers g. Thus, g € S. 

It is obvious that S is the smallest closed sieve on C which contains 
S. Consequently, S is called the closure of S. This closure operation is 
natural, in the sense that for any arrow g: D - C, 


9° (S) = 9"(5). (9) 
Indeed, 9*(S) © g*(S) and the latter is closed by (5), so g*(S) C g*(S). 
Conversely, if f € g*(S) for some morphism f: B — D, then gf € S, 


ie., S covers gf, or equivalently g*(S) covers f. Thus, f € g*(S). 
Lemma 1. The presheaf 2 of (6) is a sheaf. 


Proof: We first show that the presheaf 2 is separated. So suppose 
M, N € Q(C) are two closed sieves on C, while S is a cover of C such 
that g*M = g*N for any g € S. Then MNS = NMS. Take any f € M. 
Then M covers f, and S covers f since S covers C, so MS covers f. 
But MNS = NAS CN, so N covers f, and therefore f €N since N 
is closed. This shows MC N. Repeating the argument with M and N 
interchanged gives M = N. Thus, 2. is indeed separated. 

It remains to show that matching families have amalgamations in 2. 
Let § € J(C) be a cover, and let My € Q(D), for f € S with f: D> C, 
form a matching family of closed sieves, so that 


9" (My) = Mfg (10) 
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for all f € S and all g composable with f. Consider the sieve 
M={fog|geM;s, feS}. (11) 


This M need not be closed (why?), but we claim that its closure M is 
the required amalgamation of the My. First, it follows from (10) that 


f°(M) = My (12) 


for any f € S. Indeed, it is clear that My C f*(M). And conversely, if 
u € f*(M), ie., fu € M, then for some f’ € S and g € My, fu= f'g. 
Thus, My, = Myrg, or by (10), u*(My) = g*(My). But g € My, so 
g* (My) is the maximal sieve; hence so is u* My, i.e., u € My. Now (12) 
and (9) give f*(M) = My = Mj, since M; is closed. This shows that 
M is an amalgamation for the My, (f € S), and completes the proof 
that © is a sheaf. 


Lemma 2. Let F be a sheaf on C, and let A C F be a subpresheaf 
of F. Then A is a sheaf iff for all C € C, x € F(C) and all covers S of 
C, it holds that x € A(C) whenever x- f € A(D) forall f: D>C in S. 


Proof: This is immediate, for the condition on A simply states that 
the amalgamation of a matching family of elements of A, which neces- 
sarily exists as a uniquely determined element of F', actually lies again 
in A. 

Observe that the maximal sieve on C, tc = { f | f has codomain C } 
is obviously closed, and that for any morphism g: D — C in C, we have 
g* (tc) = tp. Thus, C+ te defines a natural transformation 


true: 1 +. (13) 
Proposition 3. The sheaf 2 of all closed sieves, together with the 
map true of (13), is a subobject classifier for the category Sh(C, J). 


Proof: Let F' be a sheaf on C, and let A C F be a subsheaf. We 
propose a “characteristic function” 4: F —» 2 for A, defined in the 
same way as for presheaves: 


(xa)o(z) ={f: D>C|a- fe A(D)}, 


where C' € C and x € F(C). By Lemma 2 above for x - g, (ya)c(2) iS 
a closed sieve on C’. Moreover for any g: B — C in C, we have 


fe(xa)B(@-9)  iffe-g-feA 


iff gf € (xa)c(z) 
iff f € g*((xa)o(2)), 
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so XA is a natural transformation [cf. (7) above]. 
We verify now that the square of sheaves 


is a pullback. Since limits in Sh(C, J) are computed as limits in SetsC”, 
i.e., pointwise, this square is a pullback precisely when, for all C € C 
and all « € F(C), « € A(C) iff (yva)c(x) = tc, and this is indeed 
the case as is clear from the definition of x4. This last equivalence 
also shows that x is unique, for lc € (xa)c(2) iff « € A(C) implies 
by naturality of ya that for any f: D - C in C, f € (xa)c(2) iff 
Ip € f*(xa)o(2) = (xa) (a f), iff x - f € A(D). 


Corollary 4. Every Grothendieck topos is an elementary topos. 


Conversely, however, an elementary topos need not be a 
Grothendieck topos. In the Appendix, we will see that the following 
additional properties distinguish the Grothendieck topoi among the el- 
ementary ones: The existence of a set of generators, and the existence 
of all small coproducts. 

As an application, let us characterize epimorphisms of sheaves. 


Corollary 5. A morphism ¢: F - G of sheaves is an epimorphism 
in the Grothendieck topos Sh(C, J) iff for each object C of C and each 
element y € G(C), there is a cover S of C' such that for all f: D> C 
in S the element y- f is in the image of dp: F(D) — G(D). 


The conclusion is often phrased “¢ is locally surjective”. 


Proof: First suppose that ¢ is locally surjective and that a, 3: G > 
H are maps of sheaves with ag = Bd. For any object C of C and any 
y € G(C), pick a cover S as in the statement of the corollary. Then 
clearly for every f € 9 one has a(y- f) = B(y- f), or a(y)- f = B(y)- f. 
Since H is a sheaf and S a cover, a(y) = B(y). 

Conversely, given an epi ¢: F — G, define a presheaf A C G by 


A(C) = 
{y € G(C) | dacover Sof CVf: BC in S: y- f €Im(¢z)}. 


By Lemma 2, A is a subsheaf of G. Let y4: G — 1. be its characteristic 
map. Then ¢c takes F(C) into A(C), so the square 


F 1 
| [ie 
G Q 
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commutes. Thus, ¢ epi implies that y4 coincides with the composite 
true 
eae Q, so A = G and the corollary follows. 


In exactly the same way, we can derive a slightly stronger version of 
Corollary 5. Suppose P and Q are presheaves on C, while ¢: P — Q is 
a natural transformation. I¢ still makes sense to define ¢ to be locally 
surjective for the topology J iff the conclusion of Corollary 5 is satisfied. 
As before, a: Sets~” —> Sh(C, J) denotes the associated sheaf functor. 


Corollary 6. For given ¢: P - Q, the map a(¢): aP — aQ is an 
epimorphism in Sh(C, J) iff @ is locally surjective. 


Proof: (<=) Let a, 3: aQ — F be maps into a sheaf F, such that 
aoa(?) = Goa(¢). Then by the naturality of 7: P - aP, one also has 
acnog= Bonog: P > F. This implies that an = Sn in the case ¢ is 
locally surjective, by exactly the same argument as in Corollary 5. By 
the universality of 7, we conclude that a = 8. 

(=) Define a subpresheaf A C Q as in the proof of Corollary 5. 
Write 0) for the subobject classifier of Sets©”, and © for that of 
Sh(C, J). So N')(C) is the set of sieves on C, while 2(C) C N®)(C) is 
the set of closed sieves on C,, for any object C of C. Let x4: Q > 2) 
be the classifying map for A C Q; so for C € C and y € Q(C), 


(xa)o(y) ={f: D>Cly-f € A(D)}, 


as in 81.8. It follows readily from the definition of A that (xa)c(y) is 
always a closed sieve. So y, factors through i: 2 >). Since 2 is a 
sheaf, xa = 1X4 ° NQ for a unique ¥4: aQ > 2. Now clearly x40 ¢ 
factors through true: 1 — ©; hence, by universality and naturality of 7, 
so does ¥4 0 a(¢): aP — (Q. But a(¢) is epi; hence, ¥4 factors through 
true. Then so does y4; hence, A = Q, ie., ¢ is locally surjective. 

As a special case, we observe that the Grothendieck topology J on 
C can be recovered from the category Sh(C, J) of sheaves, together with 
the functor ay: C —» Sh(C, J). (Recall that a family { f;: C; + C} is 
said to cover C' if the sieve it generates is a covering sieve.) 


Corollary 7. A family { f;: C; > C} covers C iff the induced map 


[]av(c:) > ay(c) 


is an epimorphism in Sh(C, J). 


Proof: By construction of coproducts in Sh(C, J), the coproduct 
I] ay(C.) is the associated sheaf of []® y(C,), where ]]®) denotes the 
(pointwise) coproduct in the presheaf category Sets”. The statement 
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easily follows by applying the previous corollary to the case where ¢ is 
the map ]]” y(Ci) > y(C). 
This result will be used in the “semantics” to be considered in §VI.7. 


8. Subsheaves 

Let (C, J) be a fixed site and Sh(C, J) the associated Grothendieck 
topos, as in the previous section. For each J-sheaf E on C, let Sub(£) 
be the set of subobjects A of E in Sh(C, J). Thus, each A € Sub(£) 
can uniquely be represented by a functor 


A: CC? —> Sets, 


such that (i) for each object C' of C, the set inclusion A(C) € E(C) 
holds, (ii) for each morphism C’ — C' the restriction A(C) — A(C’) 
agrees with that of E, and (iii) for each object C' of C, each cover S of 
C, and each e € E(C) one has 


e- f € A(D) for every f: D— C in S implies e € A(C) (1) 


(cf. Lemma 7.2). 
The subobjects of E are partially ordered in the usual way, which in 
this case can be expressed as 


A<B_ iff A(C)CB(O), forall CEC, (2) 


for A, B € Sub(E) as above. Clearly, E itself satisfies condition (1), so 
this poset Sub(£) has a largest element. Moreover, if A and B are sub- 
functors of E which satisfy (1), then so does their pointwise intersection 
ANB: 

(AA B)(C) = A(C)N BC); (3) 
this defines the meet of A and B in Sub(£). In fact, this applies to any 


family of subobjects {A;} of E: the infimum (A, A; exists in Sub(£), 
and can be described pointwise; i-e., 


(A Ad(C) = (AKC). (4) 


It follows that Sub(E) is a complete lattice, since as always suprema can 
be described in terms of infima by /; A; = A{ B| A, C B for all i}. In 
the present case of Sh(C, J), suprema can also be described explicitly 
by the equivalence, for any C € C and any e € E(C): 


e € (\/ A;)(C) iff { f: D> C e+ f € Ax(D) for some i} € J(C), (5) 
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where {A;} is any family of subobjects of E. To see this, first no- 
tice that the right-hand side of (5) describes a subfunctor of FE; for if 
S={f:D—-C|e-f € A,(D) for some i} covers C, then for any 
morphism g: C’ — C, the sieve g*(S) covers C’. But 9*(S) ={h: D> 
C’ | (e-g)-h € Aj(D), for some i}, so e-g again satisfies the right-hand 
side of (5). That this indeed defines the sup of the A; is now clear, be- 
cause the subfunctor of FE defined by the right-hand side of (5) evidently 
satisfies condition (1) above (by the transitivity axiom for Grothendieck 
topologies) and is the smallest such containing all the A;. 


Proposition 1. For any sheaf E on a site (C, J), the lattice Sub(£) 
of all subsheaves of E is a complete Heyting algebra. 


Proof: We have just noticed that sups and infs exist, so it suffices 
to prove the distributive law: that for a family of subobjects {A;} and 
another subobject B, 


BAV A=\/ BAA; (6) 


holds [see §2, Example (d)]. Here the inclusion > always holds. To 
prove the inclusion C, take e € E(C) for any object C € C. Suppose 
that e € B(C), and moreover that e € \/; Ai, so that the sieve S = 
{f:D—C|e-f € A;(D) for some i} covers C [cf. (5) above]. Then 
for f € S,e- f € (BA A;)(D) for some i by (3), so clearly the same 
cover S shows that e € (VV; BA A;)(C), by (5). 

The complete distributive lattice Sub(Z) is thus Heyting, and so has 
an implication operator. We claim that this operator > on Sub(£) can 
be explicitly described by 


e€(A> B)(C) 
iff for all f: D— C,e- f € A(D) impliese- f € B(D), (7) 


for any two given A, B € Sub(£), any C € C and any e € E(C). To 
prove (7), we temporarily interpret (7) as defining (A = B)(C) C E(C) 
for each C' € C. Then this A = B is clearly a subfunctor of E [cf. (ii) 
above], and by the stability axiom for Grothendieck topologies it follows 
that A = B also satisfies condition (1). Thus A => B as defined by (7) 
is indeed a subobject of E. To prove that it describes the implication 
operation in Sub(£), it now suffices to verify that A = B as defined by 
(7) enjoys the property 


UC(ASB) if UAACB forallUeSub(E), (8) 


which characterises the implication in a Heyting algebra (cf. §1.8). It is 
clear that (8) follows from (7). 
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Any morphism ¢: E — F of sheaves induces a functor on the corre- 
sponding partially ordered sets of subsheaves, 


¢ ': Sub(F) — Sub(£) (9) 
by pullback: for B C F and C' € C, that is, by 
eed '(B)(C) _ iff dc(e) € B(C) (10) 


for alle € E(C). Clearly ¢~ is order-preserving, i.e., is indeed a functor 
of partially ordered sets. We claim that ¢~! has both a left and a right 
adjoint. The left adjoint, which is usually written as 


dg: Sub(£) — Sub(F), (11) 

should, by the definition of adjoints for posets, satisfy 
4g(A) C B iff AC ¢ 1(B). (12) 
For A C E£, define a proposed 34(A) by setting, for C € C and y € E(C), 


y € 4g(A)(C) 
iff { f: D-+C | 5a € A(D), ép(a) = y- f} is acover of C. (13) 


(Note the existential quantifier 4a.) In other words, we define 44(A) by 
taking the pointwise image of each composite A(C’) > E(C) - F(C), 
and then closing off under (1). Leaving the straightforward details to the 
reader, we remark that this 3g(A) € F is a subfunctor by the stability 
axiom for Grothendieck topologies, and in fact is a subsheaf [cf. (1)] by 
the transitivity axiom. To prove that (12) holds, consider any subsheaf 
BC F. Suppose that 3,(A) ¢ B. Then for any C € C and a € A(C), 
one has ¢¢(a) € 4g(A)(C) € B(C); that is, a € 651(B) = 671(B)(C). 
Thus, A C ¢—1(B). Conversely, suppose A C ¢~1(B), and consider any 
C eC and y € 34(A)(C). By the definition (13), the latter means that 
there exists a cover S of C such that for each f: D — Cin S, y-f isin the 
image of gp: A(D) — F(D). But by assumption A(D) C ¢7'(B(D)), 
80 the image of ¢p is contained in B(D). Therefore y- f € B(D) for 
each f € S. Since B is a subsheaf, this implies that y € B(C). This 
shows that 44(A) C B, and completes the proof of (12). 

Next, we construct for ¢: E — F the right adjoint of ¢, usually 
denoted by 

Vo: Sub(£) - Sub(F). 


For A C E, and y € F(C) for an object C of C, one might try the 
definition 
y € V4,(A)(C) iff for all x € E(C), ¢c(z) = y implies x € A(C) 
(14) 
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[that is, 6>'(y) C A(C)]. This is in effect exactly the definition of the 
universal quantifer V4 for subsets of a set, as presented in Theorem 1.9.2. 
However, the proposed definition (14) does not give a presheaf. Hence we 
“stabilize” under the action of C on F’, and so define V4(A) by vskip-3pt 


y €V4(A)(C) iff for all f: DC, y- f €V4,(A)(D) 


“A (15) 
iff for all f: DC, $5 (y: f) € A(P). 


Then for any arrow g: C’ > C, y € Vg(A)(C) clearly implies y- 9 € 
V¢(A)(C’), so (15) does define a subfunctor of F. 

Now we show that this subfunctor is actually a subsheaf. To this end, 
consider some y € F(C) and a cover S of C such that y-g € Vg(A)(C’) 
for all g: C’ > C in S. We must show that y € Vg(A)(C), ie, that 
for all f: D > C, ¢5'(y- f) G A(D). So suppose y- f = ¢p(z) for 
some z € E(D). Now f*(5S) is a cover of D, and if h: C’ 3 D is 
any arrow in f*(S), then fh € S so y- (fh) € Vg(A)(C’). Thus, since 
gon(z-h) = y- (fh), we have c-h € A(C’). Since this holds for all 
h € f*(S) and A is a subsheaf of E, we conclude that  € A(D). This 
proves that ¢,'(y- f) C A(D), as was to be shown. 


Next, we prove that Vy as defined by (15) is indeed right adjoint to 
¢@ 1; i.e., that for any subsheaves A C E and BC F, 


@ \(B)CA iff BC V4(A). (16) 
In one direction, suppose ¢~1(B) € A, and consider any C' € C 
and b € B(C). Let f: D — C be a morphism in C and let x € E(D) 
be such that @p(z) = b- f. Then @p(zx) € B(D), so z € A(D) since 
¢ 1(B) C A. Thus, b € Vg(A)(C). The converse implication in (16) is 
even easier. 
We have now proved the following. 


Proposition 2. For any morphism of sheaves ¢: E — F on a site, 
the pullback functor 6~!: Sub(F) > Sub(£) has both a left and a right 
adjoint: 3 

asics Svan 
-1 
Sub(Z) —2——-Sub(F), 3g 1¢71 49. 


Veo 4 


We emphasize that it is not only the existence of these adjoints, but 
also their explicit description as given by (13) and (15), which will be of 
importance later on. 

The adjoints of Proposition 2 are natural in the sense expressed in 
Exercise 15. 

To conclude this section, let us briefly consider some examples. Let A 
be a complete Heyting algebra, with its natural Grothendieck topology 
[§2, Example (d)], and consider the topos Sh(A). A subsheaf of the 
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terminal object 1 is a functor S: A°? — Sets such that S(a) C {0} for 
every a € A, and, moreover, such that a = \/ a; implies that 


if 0 € S(a;) for all i then 0 € S(a) 


[cf. (1) above]. So S is completely determined by the element s = \/{ a | 
0 € S(a)} of A, and we find that s + S' gives 


A & Sub(1). (17) 


This shows that any complete Heyting algebra can be realized as a sub- 
object lattice in a Grothendieck topos. 

Before discussing the next example, consider for a moment an arbi- 
trary site (C, J). If E is a sheaf on C, the minimal element 0 € Sub(£) 
is not necessarily the empty functor because this functor need not be a 
sheaf. This happens because it may be that an object C’ of C has an 
empty cover; i.e., @ € J(C). Now when @ is a cover of C, a matching 
family of elements of E for the empty cover is by definition a function 
on the empty set; there is only one such function, and it is obviously 
matching. However, in the empty functor C°? — Sets, there is no el- 
ement to be an amalgamation so this functor is not a sheaf. However, 
for a given sheaf E on Sh(C, J), 6 € J(C) implies, by the existence of 
a unique amalgamation, that there is exactly one element z € E(C); so 
the subfunctor 0 defined by 


OC) ={eeH(C)}, 0€I(C) Ags 
=, 6¢ J(C) 
is a sheaf, and is clearly the smallest subsheaf of E (see also Exercise 13). 
Now consider any subsheaf B of a given sheaf E. By definition, its 
“pseudo-complement” (i.e., its negation) ~B is the largest subsheaf U 
of E with U A B = 0; i.e., with 0 as in (18), 


AB = \/{U € Sub(Z) |U AB =0} 
=B>0 


[cf. §1.8, (8) and (9)]. From (7) and (18), we see that ~B can be explicitly 
described by the equivalence, for any C’ € C and x € E(C), 


t € ~B(C) iff for any f: DC, x- f € B(D) implies that @ € J(D). 
(19) 

Now consider as a special case the dense topology of §2, Example (e). 
The empty family can never be dense, so 0 is, in fact, the empty functor 
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in this case. Thus, if B is a subsheaf of a sheaf E, the subsheaf ~B can 
simply be described for any object C' of C by 


>~B(C) = {x € E(C)| for no f: D>C,«- f € B(D)}. (20) 
Now let « € E(C) be any element of E, and write 
S,={f:D3C|«-feB(D)orz- f ¢-B(D)}. 


Then S, is dense below C, for if g: C’ — C is any morphism, then 
either some restriction of x -g ends up in B |say (2- g)-h € B(C”) for 
some h: C” — C"], in which case gh € S,, or none does (in which case 
g € S,). This shows that 


BV-B=E, (21) 


so Sub(£) is, in fact, a complete Boolean algebra. 

Finally, consider the case of the atomic topology J on a category C, 
as in §2, Example (f). The atomic topology is a special case of the dense 
topology, so again Sub(£) is a complete Boolean algebra for any sheaf 
E on C. Let « € E(C) be some element of a fixed sheaf E. We wish 
to show that there exists a smallest, subsheaf A C E with « € A(C). 
Clearly, such an A must contain all restrictions of 7; i.e., 


z-f €A(D) forall f: DOC. 


Therefore, since every nonempty sieve is a cover, property (1) from the 
beginning of this section gives 


y € A(D) if there are morphisms C - E- DinC with y:g=a-f. 

(22) 
But taking (22) as a definition of A, for a fixed element x € E(C), one 
can easily show that A is, in fact, a subsheaf of E (using the property 
expressed by (10) of §2). This subsheaf A is therefore an atom of Sub(£); 
i.e., it contains no smaller nonzero subsheaf. The argument above shows 
that any B € Sub(£) with B # 0 contains such an atom A. This proves 
that Sub(£) is in fact an atomic complete Boolean algebra. (This latter 
property actually characterizes the atomic topology.) 


9. Continuous Group Actions 


We consider the category BG of continuous G-sets, where G is 4 
topological group. Recall from §I.1 that the objects of BG are sets X 
equipped with a right G-action 


XG aX (1) 
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which is continuous when X is given the discrete topology, and that the 
morphisms of BG are functions which preserve this action. We write 


Homg(X,Y) 


for the set of morphisms X — Y in BG. 
If G acts on X and z € X, the isotropy subgroup of z, 


is open for each x precisely when the action map (1) is continuous, as is 
easily verified (see the exercises of Chapter I). 

To show that BG is a Grothendieck topos, we will explicitly con- 
struct a site for BG. Let S(G) be the full subcategory of BG whose 
objects are right G-sets of the form 


G/U, 


where U is an open subgroup of G (if U is open, then the quotient 
topology on G/U is discrete). So the elements of G/U are right. cosets 
Uz (x € G), while G acts on the right on each coset by 


(Uz)-g =Uzg. 


The isotropy subgroups are the conjugates of U: Iyz = {g | Urg = 


Uz}=27!Uz. A morphism G/U = G/V in S(G) has to preserve the 
action, ie, o(Uzg) = ¢(Ux)g, so ¢ is completely determined by what it 
does on the coset U = Ue (e being the unit of G). On the other hand, 
if a is any element of G we may try to define a morphism 


¢: G/U > G/V 


in BG by the formula ¢(Uz) = Vaz. This is well-defined on cosets if 
Ux = Uy implies Vax = Vay, or equivalently, if U C a~1Va. Putting 
these observations together, we conclude that morphisms G/U — G/V 
in S(G) correspond to cosets Va with the property that U C a7!Va. 
If U C a-'Va for a particular a € G, the corresponding morphism of 
S(G) is denoted by G/U Bae /V. In this notation, composition in S(G) 
Corresponds to multiplication in G. 

Notice that all morphisms in S(G) are epimorphisms in BG. This 
Suggests that we equip S(G) with the atomic topology, i.e., the topology 
in which every nonempty sieve is a cover, as in §2, Example (f). This 
Satisfies condition (10) there, and so is indeed a Grothendieck topology, 
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b 
for given two morphisms G/W — G/V -G /U, we may complete them 
to a commutative square 


CO eu 


by choosing an open subgroup O small enough for the indicated maps 
to be well-defined, i.e, O C aVa-1 NbWd-?. 
There is a canonical functor 


¢: BG > Sets", g( X) = Homg(-, X) (3) 
induced by the inclusion S(G)—- BG. Note that 
(X)(G/U) = Home(G/U, X) 
axe. (4) 


where XU = {x € X | Vg € U(zg = z) } is not an exponential, but is 
the set of U-fixed points, as usual. In terms of these fixed point sets, 
(-)-a 
¢(X) sends a morphism G/V —s G/U to the function XY ——-+ XY; 
the action by a indeed maps XY into XY because I,., = a~!Iza, so 
ifz € XY, ie, U C Ig, then Ip.g D a 1Ua D V by definition of the 


morphisms in S(G); ie., z-a € XY”. 


@ 
Theorem 1. For any topological group G, the functor BG — 
SetsS(O)"" induces an equivalence of categories 


BG & Sh(S(G)), 


where on the right the sheaves are taken with respect to the atomic 
topology. 


Proof: Define a functor 
ap: Sets” _, BG 
by setting, for a presheaf F on S(G), 
v(F) = lim, F(G/U), (5) 


where the colimit is taken over all open subgroups U of G, partially 
ordered by inclusion. So the elements of 7(F) are equivalence classes 
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[x, U] where x € F(G/U), and [z,U] = [y, V] whenever there is an open 
subgroup W C UNV such that x and y have the same image under the 


maps F(G/W — G/U) and F(G/W —> G/V), with e the unit element 
of G. The group G acts on the set #(£) by 
(x, U] ‘g= [y,g~ Ug], 


where y = F(G/g71Ug 4.6 /U)(z). This action is indeed continuous 
since clearly [z,U]-g = |x, U] if g € U, and is well-defined on equivalence 
classes. Also is a functor; for ifr: F' — F’ is a morphism of presheaves 
in SetsS()"” that is, a natural transformation, then the components of 
7 induce a map (7) on the colimit (5) in the obvious way: 


%(7)|z,U] = [rev (2), U). 

Note that if X is a continuous G-set, then X = Uy XY since all 
isotropy subgroups are open, and since XU & ¢(X)(G/U) by (4), this 
means precisely that there is a natural isomorphism 

pod HI1d. 

On the other hand, we just noticed that the isotropy subgroup of [x, U] € 
#(F) contains U, so that [z,U] € ¥(F)" and we obtain, by (4), a map 
a(F)y = ay: F(G/U) > $(F)" = ob(F)(G/U) 
oy (x) = [z,U], z€ F(G/U), 
for each open subgroup U. This in fact defines a natural transformation 


ee op(F); for if G/U Se /V is a morphism in S(G), commutativity 
for a € G of the diagram 


F(G/U) 25 o(F)” 


ro | —)-a 


F(G/V) az WF)” 


simply means that for y € F(G/V), av(F(a)(y)) = [F(a)(y), U] defines 


the same element of #(F’) as [F(G/a-1Va a G/V)(y),a~1Val], and this 
follows from the commutativity of 


G/a-'Va —*-+ G/V 


|< 


G/U 
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[recall that U C a~!Va by definition of the morphisms in S(G)]. The 
definition a = ap: F > $y(F) is also natural in F, as is easily checked. 

To complete the proof, we show that a is an isomorphism in the case 
where F' is a sheaf for the atomic topology. If F is a sheaf, all maps 
F(G/U) > F(G/V) coming from V C U involved in the colimit (5) are 
monos, so that the canonical maps (the a above with larger codomain) 


ay: F(G/U) > lim, F(G/V) = 4(F) 


are also monomorphisms. The image of ay is contained in w(F), as 
already observed, so we are done if we prove that every element of p(F)¥ 
occurs in the image of ay. Pick any [z,V] € ¥(F)U. We may choose 
V small enough so that V C U. We claim that « € F(G/V) defines 
a matching “family” for the singleton cover G/V S G/U, i.e., that 
whenever 

G/W — G/V —> G/U (6) 
commutes, F(a)(z) = F(b)(z) € F(G/W). Since F is a sheaf, F' sends 

1 


G/aWa") ary 6: /W to a mono, so by precomposing with this map we 
may without loss assume that a = e in (6). Commutivity of (6) then 
means that b € U (and that W C V and W Cb-!Vb). But z Ev(F)", 
so 


b 
[F(G/b-1Vb — G/V)(z),b-1 V0] = [z, V] 
in ¥(F). This means that for a sufficiently small W’ C VNb7'V8, we 


e€ 


b 
have F(G/W' — G/V)(z) = F(G/W' — G/V)(z). Choosing W’ C W 
and using the fact that F(G/W’ =, G/W) is mono, we conclude that 
F(G/W -> G/V)(2) = F(G/W + G/V)(a). This shows that « is a 
matching family, as claimed. Therefore, there is a unique y € F(G/U) 


with F(G/V = G/U)(y) = x since F is a sheaf. But then [z,V] = 
ly,U] = au(y). Thus, [z,V] is in the image of ay, and the proof is 
complete. 


As a variant, we state 


Theorem 2. Let G be a topological group, and let U be a cofinal 
system of open subgroups (in the sense that any open subgroup contains 
a member of U). Then there is an equivalence of categories 


BG © Sh(Su(@)), 


where Sy(G) is the full subcategory of S(G) whose objects are of the 
form G/U with U €U, equipped with the atomic topology. 
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Proof: The proof of this result is completely analogous to that. of 
Theorem 1, as we leave for the reader to check. Theorem 2 can also be 
derived from Theorem 1 by using the “comparison lemma”, to be proved 
in the Appendix. 


As an example, let us consider the topological group G = Aut(N) 
of all automorphisms of the set N of natural numbers, with the product 
topology inherited from [];°, N = NN. Let U(K) = {a € Aut(N) | 
a(i) = 7 for i € K }, where K is a finite subset of N, and let Y be the 
collection of all U(K)’s. This Y is a cofinal system of open subgroups 
of Aut(N). We claim that there is a 1-1 correspondence between mor- 
phisms G/U(K) > G/U(ZL) in Sy(G) and monomorphisms L — K. For 
suppose a: G/U(K) — G/U(L) represents a morphism in 8,,(G), where 
a € Aut(N). Then U(K) C a7!U(L)a, or aU(K)a™! C U(L). This 
means that ada! fixes L for any automorphism ¢ which fixes K, and 
this is equivalent to the condition that a~1(L) C K. 

We may thus define a contravariant functor from S,,(G) to the cat- 
egory I of finite subsets K of N and monomorphisms between them, by 
sending G/U(K) to K and G/U(K) — G/U(L) toa: LK. This 
functor is well-defined, for 

U(L)a =U(L)B => a6! €U(L) 
<=> Vre L, af \(z)=2 
<= Vee L, B-\(x) =a7"(a), 
ie, a! and G7! define the same morphism L-> K. This also shows 
that this functor S,,(G) — I°? is faithful. It is also full since clearly any 


monomorphism [>— K can be extended to an isomorphism N — N. 
Therefore, we have an equivalence of categories 


Sy(Aut(N)) & 1°, 


and from Theorem 2 one obtains: 


Corollary 3. The category B Aut(N) is equivalent to the category 
of sheaves on the category I°P for the atomic topology, where I is the 
category of all injective functions between finite subsets of N. 


The topos B Aut(N) © Sh(I°P) of Corollary 3 is often called the 
Schanuel topos. For another description of it, see Exercise 13. 


Exercises 


1. Let X be a topological space. For a sieve S on an open subset 
U of X define S covers U iff U is the union of the sets in S. 
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Prove that this defines a Grothendieck topology on the partially 
ordered set, O(X) of all open subsets of X. 


. As alternative properties for a Grothendieck topology, consider: 


(iii*) (Weak Transitivity) If S € J(C) and T C S is any sieve 
on C’ such that T covers all arrows in S, then T € J(C). 
(iv) (Intersection) If S and T are sieves on C’, then SNT € 
J(C) iff Se J(C) and T € J(C). 
(v) (Inclusion) If S C T are sieves on C, then S € J(C) 
implies T € J(C). 


Show that the axioms for a Grothendieck topology may be taken 
to be (i), (ii) (of §2) together with (iii*) and (iv), or (i), (ii), (iii*) 
and (v). 


. (Bases for a Grothendieck topology on a category without pull- 


backs.) Let C be an arbitrary small category. Define a ba- 
sis for a Grothendieck topology on C to be a function K as 
in §2, Definition 2, except that (ii’) is replaced by: (ii”) If 
{ fi: C: 2 C |i € 1} © K(C), then for any morphism D ~ C, 
there exists a cover {h;: D; — D | j € I'} € K(D) such that 
for each j, go h; factors through some fj. 


(a) Check that (2) (of §2) still defines a Grothendieck topology 
on C, 

(b) State and prove the analogue of Proposition 4.1, for this 
definition of a basis on an arbitrary category C. 

(c) Show that Lemma 4.2 is a special case of the statement of 
part (b). 


. Let T be as in §2, Example (b), with the open cover topology 


given by the basis K as defined there. Define K' by { fi: ¥; > X | 
i€I}e K'(X) iff each f; is étale (i.e., a local homeomorphism; 
cf. §I1.6), and moreover X = [); fi(Yi). Show that K and Kk’ 
generate the same topology J on T. 
(a) Let T be as in §2, Example (b). Define A” by { fi: Yi > 
X|ierl}e K"(C) if X =U, f(%). Show that K” is 
a basis for a Grothendieck topology on T. Assume that 
1 € T and that all constant maps 1 —> T € T are in T. 
Prove that the category of K’’-sheaves on T is equivalent 
to the category of sets. 
(b) Let T be as in (a), and assume that the empty set does 
not belong to T. Prove that the topology generated by 
Kk" coincides with the dense topology on T. 
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6. Let C be a small category. 


(a) Check that if {Ja}q is a family of Grothendieck topologies 
on C, then ()Jq [defined by (()Ja)(C) = ()Ja(C)] is 
again one. 


It follows from (a) that, given any collection of sieves Sy on 
Cq (for a in some index set A), there is a smallest Grothendieck 
topology making these sieves into covers. As an example consider: 


(b) Let M be a category of manifolds, as in §2, Example (b). 
Prove that the open cover topology on M is the small- 
est one with the property that (i) the sieve generated by 
{(—1, 00) + R, (—0o, 1) > R} covers R; and (ii) the sieve 
generated by {(—n,n)— R|n > 0} covers R. 


7. (a) Show that the Zariski site (§3) is subcanonical. 
(b) Characterize the posets for which the dense topology is 
subcanonical [§2, Example (e)]. 


8. Let C be a small category, and let P: C°P — Sets be a presheaf 
on C. Recall from §5 of Chapter I that the category iP P of 
elements of P has as objects the pairs (x, C’) with z € P(C), and 
as morphisms (xz, C’) — (x’,C’) the morphisms f: C — C’ in C 
with the property that x’. f =<. 


(a) Prove that there is an equivalence of categories 


Sets” /P sets)” 


(b) Suppose that J is a topology on C, and that P: C°P = 
Sets is a J-sheaf. Describe a topology J’ on f., P such 
that the equivalence of (a) restricts to an equivalence 


Sh(C, J)/P & Sh( iL P, J’). 
Cc 


9. Let C be a small category, and let J be a Grothendieck topol- 
ogy on C. Let Sh(C, J )P*” be the category of functors D°? > 
Sh(C, J) and natural transformations between them, for some 
given small category D. Show that Sh(C, J)P°? is a Grothendieck 


158 


10. 


11. 


12. 


13. 
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topos, by exhibiting an equivalence of categories Sh(C x D, J’) = 
Sh(C, J)” for some suitable topology J’. 

Let X be a topological space, and let G be a (discrete) group 
acting on X by a continuous map G x X > X, (g,z)> g-az. An 
étale G-space over X is an étale map p: E — X (as in Chapter II, 
§6), where E is equipped with an action G x E — E by G such 
that p is compatible with the two actions on EF and on X. 


(a) Use the correspondence between étale spaces and sheaves 
of 811.6 to show that the category of étale G-spaces is a 
Grothendieck topos, by explicitly describing a site. 

(b) Prove that if the action of G on X is proper, then the 
category of étale G-spaces is equivalent to the category 
Sh(X/G) of sheaves on the orbit space X/G, where X/G 
is equipped with the quotient topology. (Recall that an 
action by G on X is called proper if for each point z € X 
there is a neighborhood U, of z with the property that for 
any g €G, if g-U, NU, # @ then g =e.) 


Let. J be a Grothendieck topology on a small category C. 


(a) Give an example to show that a: Sets“” — Sh(C, J) 
does not always preserve arbitrary limits. 

(b) Show that ay: C — Sh(C,J) preserves all limits which 
exist in C, if J is subcanonical. (What if J isn’t?) 

(c) Show that ay: C — Sh(C, J) preserves all exponentials 
which exist in C, if J is subcanonical. 

(d) Give examples which illustrate that ay: C — Sh(C, J) can 
be full, but not faithful, faithful but not full, or neither full 
nor faithful. 


Let J be a Grothendieck topology on a small category C. Define a 
presheaf O: C°P — Sets by setting O(C) = {0} if @ € J(C), and 
O(C) = @ otherwise. Prove that O is a sheaf on C, and that it is 
the initial object of Sh(C, J). Prove that for any E € Sh(C, J), 
the unique map O — E is mono. Show that O is isomorphic to 
the bottom element of the subobject lattice Sub(£) defined in 
(8.18). 

Let I be the category of finite sets and monomorphisms. Show 
that a functor P: I — Sets is a sheaf for the atomic topology on 
[°P iff 


(i) P sends every morphism of I to a monomorphism, and 
(ii) P preserves pullbacks. 
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14. 


15. 


16. 


17. 


Let X be a topological space. Recall that for a set S, A(S) € 
Sh(X) is the associated sheaf of the constant presheaf O(.X)°P > 
Sets with value S [cf. (4) of §6]. 


(a) Show that A(S) is the sheaf of continuous S-valued func- 
tions on X, where S is given the discrete topology. 

(b) Show that if X is locally connected, then A has a left 
adjoint 7: Sh(X) — Sets. [Hint: What is A(S) as an 
étale space over X?] 

(c) Show that if X is locally connected, then the functor 
A: Sets — Sh(X) commutes with exponentials (meaning 
that for any two sets S and T, the canonical morphism 
A(T*) — A(T)*©) of sheaves on X is an isomorphism). 

(d) If you are courageous, prove the converse of (c). 


[The Beck-Chevalley condition; see also §IV.9.] Let J be a 
Grothendieck topology on C. Let 


Y xx Z—4Z 


“| I 

Y ea xX 
be a pullback square in Sh(C, J). Using the explicit description 
of the functors occurring in Proposition 8.2 as given in the text, 


verify that 
wo Ay =3,, 07,1: Sub(Y) > Sub(Z). 


Deduce that ¢7! 0 Vy = Vx, 07): Sub(Z) — Sub(Y). 

Let ¢: X — Y be a morphism of sheaves on (C, J), as in the 
preceding exercise. In the following, you can either work out 
the explicit descriptions, or use abstract categorical arguments 
to deduce (b) from (a) and (a) from Exercise 15. 


(a) Show that for any two subsheaves A C X and BCY, 
3y(A A 6-\(B)) = 34(A) A B. 


(b) Show that ¢7': Sub(Y) — Sub(X) commutes with the 
implication operator => [cf. (7) of §8]. 


(a) [cf. §2, Example (d)] Let A be a complete lattice satisfying 
the identity (6) of §2 (the infinite distributive law). Check 


160 III. Grothendieck Topologies and Sheaves 


that A has the structure of a complete Heyting algebra [by 
(7) of §2]. 

(b) Show that if A is a complete Heyting algebra, then so is 
{bE€A|a<b<a’}, for any two fixed a, a’ € A. 

(c) Check that 2: A°P > Sets, O(a) = {bE A|b<a}isa 
sheaf on A (for the usual topology as in §2), and that it is 
the subobject classifier for Sh(A). 


18. Let T be a full small subcategory of the category of topological 
spaces, closed under taking open subspaces, and containing the 
one-point space. Consider the open cover topology. In §6 [below 
(15)], we defined a map p(Y*) — p(Y)°™) of sheaves on T, for 
any two topological spaces Y and X, with X locally compact. 
Prove that this map is an isomorphism if (i) X € T, or (ii) X has 
an open cover by spaces in T, or (iii) X is a metrizable space and 
T contains the subspace {0} U {1/n | n € N ~ {0}} of R. Give 
an example to show that p(Y*) — p(Y)¢(*) need not always be 
an isomorphism. 


IV 


First Properties of 
Elementary Topoi 


In this chapter we present elementary conditions (or axioms) that 
make a category € a topos, and then develop from these conditions and in 
a suitable order certain other basic properties. Most of these properties 
have already been seen to hold for our typical categories discussed in 
Chapter I, and for the categories of sheaves on a space (Chapter II) or 
on a site (Chapter IIT). 

The definition of elementary topos to be given here will be slightly 
different from (but of course equivalent to) the one mentioned in §I.6. 
Specifically, besides the existence of finite limits, the axioms require the 
existence of a subobject classifier 9 and, for each object B, a power ob- 
ject PB—identical to the exponential 27. From these it can be proved 
that arbitrary exponentials A® exist (§2), as well as finite colimits (§4). 

One of the fundamental persistence properties is that if € is a topos 
and B is an object of €, then the slice category €/B is again a topos just 
as in the familiar case when € = Sets [§I.1, Example (ix)]. Moreover, 
for a morphism B’ — B of €, the change-of-base functor E/B — €/B' 
preserves all topos structure (§7). 

In §8 it will be shown that if B is an object of a topos €, the partially 
ordered set. Subg B has the structure of a Heyting algebra, natural in 
B. It then follows by the Yoneda principle that the subobject classifier 
©, as well as each power object PB, have the structure of an internal 
Heyting algebra. 


1. Definition of a Topos 


The typical categories examined in Chapters I and II all satisfy the 
following description of an elementary topos (for short, we say just topos, 
plural topoi). 


Definition. A topos € is a category with all finite limits, equipped 
with an object 0, with a function P which assigns to each object B of € 
an object PB of €, and, for each object A of €, with two isomorphisms, 
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each natural in A 


Sube A © Home (4,2), (1) 
Home(B x A,Q) = Home(A, PB). (2) 


In other words, the functors Subg and Hom,g(B x — ,Q), the latter 
for each object B of €, are required to be representable. In the first case, 
the representing object 2 is the subobject classifier, as already described 
in $1.4, while in the second case PB may be called the power object of 
B; moreover, the function P can be extended to a functor P: E€°P - € 
in exactly one way so that (2) becomes natural in B (as well as in A), 
see (7) below. Equivalently, the natural isomorphism (2) states that 
PB is the exponential 07, as described in §1.5; therefore, any category 
which satisfies (1) and has all exponentials will, in particular, have all 
power objects. Hence, all the typical categories discussed in Chapter I 
are topoi. Also, the category Sh(X) of sheaves on a topological space X 
is a topos, as is the category Sh(C, J) of sheaves ona site. We emphasize 
that at first sight the axioms for a topos just given seem weaker than 
those of 81.6. However, it will be shown in the course of this chapter 
that the two sets of axioms are actually equivalent. 

Notice that the two natural isomorphisms (1) and (2) may be com- 
bined in a single isomorphism 


Sube(B x A) = Hom,(A, PB), (3) 


natural in A. With B = 1, this implies (1) with 9 = P1, and hence 
also (2). In other words, we could define a topos as a category with 
finite limits equipped with a single operation P satisfying (3). 

In this definition of a topos, we have referred to “sets” of the form 
Sube(A) or Home¢(A, B). In effect we have assumed that € is a locally 
small category, so that these objects are indeed small sets. However, 
topos theory may serve as a foundation of mathematics, alternative to 
set theory. (We mentioned this in the introduction, and will go into 
it in more detail in Chapter VI, below.) For this reason it should be 
emphasized that the axioms for a topos (and in fact any other use of 
“Sub” or “Hom” in this chapter) can be reformulated in an elementary 
way. This provides a first-order theory of topoi, using no set. theory. 

The method of this reformulation is the usual one, that of replacing 
an adjunction by the universal property of its counit. In this case, this 
amounts to setting A = 2 in (1), A = PB in (2), and taking the image 
under the stated isomorphisms of the identity map in the hom-set on 
the right. This leads to the following version of the definition of a topos 
in a form free of any reference to hom-sets or other sets. 
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Definition (Elementary Form). A topos is a category € with 


(i) A pullback for every diagram X + B+ Y; 
(ii) A terminal object 1; 
(iii) An object Q and a monic arrow true: 1+ such that for any 
monicm: S > B there is a unique arrow ¢: B > Q in € for which 
the following square is a pullback: 


"| Jive (4) 


In this case we write ¢ = char S or ¢ = charm, and call @ the 
characteristic map of m (sometimes called the classifying map of 

(iv) To each object B an object PB and an arrow €g: Bx PB —Q 
such that for every arrow f: B x A— 91) there is a unique arrow 
g: A— PB for which the following diagram commutes: 


A Bx A—+—9 
a| xa! | (5) 
PB Bx PB—~—=9 

B 


Given (iv), the natural isomorphism (2) is the correspondence f +> g 
with inverse given explicitly as 


gf = €p(1 x 9); (6) 
we call g the P-transpose of f and f = g the P-transpose of g. Note 
in particular that the “counit” eg is the P-transpose of the identity 
1: PB — PB. Moreover P, construed as a functor €°P — E, sends each 
arrow h: B > C'to that arrow Ph: PC — PB which makes (2) natural 
in B; that is, to the unique arrow Ph with 


€p(1 x Ph) = Ec(h x 1): Bx PC HQ. (7) 


Thus Ph is by definition the arrow which makes the following diagram 
commute: 


Cx PC 
a, 
BxPC 2 


Bx PB. 
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This is often expressed by saying that the map € , is dinatural in B 
[CWM, p. 214]. By fitting two such diagrams together, it also follows 
that P(hk) = P(k)P(h) whenever the composite h ok is defined. Thus, 
P: € — € isa contravariant functor. 

For an arrow h: B — C, the map Ph: PC — PB is really the 
pullback operation in disguise. By this, we mean the following: consider 
amap g: A — PC and its composition Phog: A — PB with Ph. Ifthe 
P-transpose g: C x A — 2 of g is the characteristic map of a subobject 
UC x A (so that the right-hand square below is a pullback), then the 
P-transpose (Pho g): Bx A— 0 of Phog is the characteristic map of 
the pullback (h x 1)71(U) > B x A of U along h x 1: 


(os rs | 


| p.b. p.b. Jn (8) 


BeAS— FC KA, 


9 
Indeed, by definition [see (6)], 9 = €c (le x g), and (P(h) og)* = 
€p (1p x (Phog)). But 
€p (1p x (Phog)) = €g o(1g x Ph) o (1p x g) 
= €c o(h X lpg) o (1z x g) [by (7)] 
= €¢ O(1g x g)o(h x 1a) 
=G0(hx 1a), 
and this last map is indeed the characteristic map for the subobject 
(h x 1)71(U) of B x A, as in the pullback diagram (8). 

An arrow b: X — B may be considered as a sort of “generalized 
element” of B—more specifically, a (generalized) element defined “over” 
X. The elements defined over the terminal object 1 are called the global 
elements of B. If € = Sets, they correspond exactly to the actual 
elements of the set B. [The phrase “global element” comes from sheaf 
theory: the global elements of a sheaf F' on a space X are precisely 
the global sections, i.e., those defined on the entire space X, of the 
corresponding étale space AF’ — X. This is because the terminal object 

1 
1 of the topos Sh(X) is the sheaf corresponding to the étale space X — 
Xx’ 

For example, by the universal property of the projections of a prod- 
uct A x B, two generalized elements defined over X determine a gener- 
alized element (a, 5) of the product, as in the diagram 


x 
a. 


A«——- Ax B—__- B. 
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In Sets, this element is the ordered pair (a, 5). 

On the other hand, an arrow 9: B — 2: may be considered to be a 
predicate for B, or a property of generalized elements of B. For instance, 
the predicate “true of B” is 

trueg: B35 1 “@& 0, (9) 
where ! =! is the unique arrow from B to the terminal object 1. In 
this language, the statement that (4) is a pullback diagram reads: An 
“element” b: X — B factors through (is in) the subobject S»— B if 
and only if (char S)b = truex; intuitively, char S is that predicate for B 
which is true for exactly those generalized elements of B which lie in S. 
Also, the uniqueness of ¢ in (4) states that two predicates of B are equal 
if and only if they are true of the same (generalized) elements. (This is 
an extensionality principle.) 

For each object A & A x 1, the isomorphisms (1) and (2) yield 


Sub, A & Home(A,2) = Home(1, PA). (10) 


Thus, without reference to an underlying category of sets, a subobject 
of A has the corresponding three descriptions, 


m: SA, ¢:A>Q, s:1— PA, (11) 


as an equivalence class of monics to A, as a predicate of A, and as a 
global element of the power object PA. When m, ¢, and s correspond 
by (11), we write 


S={a| $}, o = char S, s="¢', (12) 
and call S the extension of the predicate ¢, ¢ the characteristic function 
of S and s the name of ¢ (or of S). If s = "@' and b: X — B is any 
element, then by (5) the following diagram commutes 

xX be ae eG 
i [ee 


A 1 Bx at BP, 


u 


where the first two vertical maps are the canonical isomorphisms. Thus, 
Ep(b x s) = truex,1 if and only if ob = truex, s= repr, 


(and hence, if and only if b is an element of the subobject named by 
8). In this way, €p is the membership predicate for B, in the sense that 
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€p(b x s) is true exactly when b is a member of (the subobject_ named 
by) s. 

For each object B, the diagonal map Ag: B > Bx B is that arrow 
whose composite with each of the two projections B x B — B is the 
identity of B. This Ag is monic, so we may form the diagram 


Bx B -------9 (13) 


where 6g = char Ag is the unique map which makes the square a pull- 
back, while the map {-}z in the bottom line is its P-transpose, via (2). 
For generalized elements 6, b': X — B and the corresponding arrow 
(b,0'): X — Bx B one thus has 6,(b, ’) = truey if and only if b = b’. 
Thus, 6g (the “Kronecker delta” ) is the predicate of equality for B. Also, 
by the description (5) applied to the transpose {-}g of 6p, 
€p(b, {-}5d') = €a(1 x {-}5)(,0") = 55 (0, 0"); 
therefore €p(, {-}"b’) = truex if and only if b = b’, and so {-}gb’ is 
that subobject of B whose only X-based element is b’. Thus, if the topos 
E is Sets, {-}3: B — PB sends each b’ € B to the usual singleton set 
{b'}; accordingly, we call {-},% the stngleton arrow for B. 
These observations motivate the following 


Lemma 1. For all objects B in a topos, {-}g is monic. 


Proof: Suppose that {-}5b = {-}gb’. Then by the definition of 
{:}B, 6p(1 x b) = 6g(1 x 0’). Now in the diagram 


X u +B >] 


(b,1 | as] fe 


Bx Xx —**.,Bx B—2—0 
both squares are pullbacks, the first. by inspection and the second by the 
definition of 6g. Therefore, the rectangle is a pullback, so (6,1) and (by 
the same diagram for b’) (b’,1) are pullbacks of the same map. Thus, 
(6,1) and (6’,1) represent the same subobject of B x X, so there is 
an isomorphism h: X > X with (6,1) = (0’,1)h, so that b = b’A and 
1 =h, whence 6 = b’, which proves that {-}g is monic. 


Since {-}g is monic, it has a characteristic function 
op =char{-}—: PB 2. (14) 
Intuitively, this og is the predicate “is a singleton”. 
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Proposition 2. In a topos, every monic arrow is an equalizer and 
every arrow both monic and epi is an isomorphism. 


Proof: The definition (4) of a subobject classifier states that any 
monic m: S > B is the equalizer of trueg [see (9)] and charm. Now if 
a monic e, as the equalizer e of any two parallel arrows f and g, is also 
epi, fe = ge implies f = g; but the equalizer of f and f can only be an 
isomorphism, so e is such, as asserted. 

Any category in which (as in a topos) monic plus epi implies iso is 
said to be balanced—but recall that many familiar categories, such as 
the category of rings, or of topological spaces, are not balanced. 


2. The Construction of Exponentials 
We next prove 


Theorem 1. Every topos has exponentials. 


In other words, the axioms of a topos, including the existence of the 
particular exponentials PB = 0, are enough to construct all exponen- 
tials C?, so that every topos is cartesian closed. This means that we 
could have defined a topos as a cartesian closed category with equalizers 
and a subobject classifier. 

The proof of this theorem is achieved by translating into topos 
language one of the descriptions of the exponential C? good in the 
particular topos € = Sets. There, C¥ is just the set of all func- 
tions f: B — C; moreover, a function f can be given by its graph 
Gs ={(fb,b)|b¢B}CCx B. Conversely, we can test whether a 
subset, S Cc C x B is the graph of some function f. If so, the value 
f(b) for each element 6 can be described by saying that the subset 
(C x {b}) NS Cc C x B, projected onto C, is the singleton set {f(b)}. So 
call this projection v(b, S) and write 


v(b,S) ={c| (c,b) eS} e PC, 
u(S) = {6 | v(b, S) is a singleton } ¢ PB; 


then S is a graph iff every v(b,S) is a singleton; that is, iff u(S) is 
B, regarded as an element of PB. Now B as a subobject B Cc B has 
characteristic function the predicate trueg: B — 1 — 1, and hence has 
the name 

rtruep? 

ee B, 
which is the P-transpose of Bx 1— B—1-— 9. This name is B, here 
regarded as an element of PB. 

This whole description of a graph can be written in any topos in 

the following diagrams, where for simplicity of notation the product 
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C x Bx P is taken to be associative, i.e., the associativity isomorphism 
C x (Bx P) =(C x B) x P is replaced by the identity: 


C x Bx P(C x B) 449, 
Bx P(C x B) —*— PC 2, 


P(C x B) ——_*—__ PB 
m | f true Bo 


oF ——______ 1. 


Here v is defined as the P-transpose of the arrow €c xg of the first 
line, where og is the predicate “is a singleton” as in (1.14), while wu is 
defined as the P-transpose of the composite arrow ocv of the second 
line. Finally, the object C? and the arrow m are defined by taking the 
bottom square to be a pullback, of "true B” along u. Since "true B™ 
has domain 1, it must be monic, and hence so is its pullback m. Thus, 
the intended exponential C? is indeed a subobject of the power object 
P(C x B). 

For this exponential we also need a corresponding evaluation map 
e: Bx C® — C, sometimes written as ev: C? x B — C. In Sets, the 
value f(b) of a function f at the argument b can be described in terms 
of the graph Gy of f as that element c whose singleton is v(b, Gy). The 
same idea gives e in any topos from the diagram 


Bx CB 1X”, By PIC x B)—2 > Pc th 


a 


Bx) aes 2 Mt PO Se ee 


1xr true Bo €E true 


where the left-hand square is Bx(definition of C®), the middle square 
is the definition of v from u via the inverse-transpose formula (1.6), the 
right-hand square is the definition (1.14) of o¢ from “singleton”, and 
the bottom distorted square is the definition of “trues” from trueg. 
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Because the right-hand square is a pullback and the diagram commutes, 
there must exist a unique map [shown dotted at the top in (2)| 


e:BxC? .C. 


To complete the proof that C? is the exponential, we must show that 
the map e is universal from B x — ; that is, that toeach f: Bx A3C 
there is a unique g: A > C® with f = e(1 x g). Indeed, if there is such 
a map g, then the definition of e in (2) above gives 


{}ef = {-}ce(1 x g) = (1 x mg): Bx A PC. (3) 


Now {-}o is the P-transpose of 6g and v is the transpose of €, so the 
P-transpose of this equation reads 


6c(1 x f) = €exa(1x1xmg):Cx Bx A. (4) 


This equation shows that mg = h is uniquely determined by f; since m 
is monic, this means that g is unique if it exists. 

Now reverse this argument, starting with f: Bx A— C. The arrow 
6c(1 x f) has a P-transpose h: A — P(C x B) which means, by the 
formula (1.5) for the transpose, that it can be written 


6c(1x f) = €exa(1x1xh):Cx Bx AQ, 


much as in (4). Transposing both sides to B x A — PC and using the 
definitions of {-}¢ and v as transposes gives 


{Jef =v(1 xh): Bx A> PC, 
as in (3). Composition with o¢ yields 
trueg of =ocv(1 xh): Bx A> Q. 

On the left, trueg of = truegx 4 = trueg op for p: Bx A > B, while on 
the right the transpose of ogv by definition is u. Hence, transposing both 
sides and writing ! 4 for the unique arrow A — 1, we obtain "trueg ol, = 
uh. This, by (1), states that h must factor through the pullback C? to 
give by (3) the desired arrow g, as in the diagram 

CF a 


v7 
g “ 
of Hy rtruep? 
a 
a 


A> P(C x B) ——> PB. 
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This completes the proof of Theorem 1. 

For objects A and B of a topos €, the exponential B4 is often called 
the “internal Hom-set” [and is sometimes denoted by Hom,(A, B)]. It 
has an operation of “internal composition” 


m: CB x BA CA, (5) 
which can be defined as the transpose of successive evaluations: 
C? yy BAKxA— 367 x B20. 


The connection of internal composition m with “external” composition 
is as follows. Given arrows f: A — B and g: B — C, one can (by 
the isomorphisms A = 1 x A and B = 1 x B) transpose these ar- 
rows to get global elements_ of the exponential objects f: 1— B4 and 
g:1—C®. Then mo (Gg, f): 1 > C4 is precisely the transpose of the 
ordinary composition go f: A — C. Similarly, generalized elements can 
be composed using the map m of (5): Given such generalized elements 
f: X — BA and g: X — C®, one obtains their composition as a gener- 
alized element of C4, simply by composing (g, f ): X > C? x B4 with 
m: CB x BA CA, The transpose of this map mo(g, f): X > C4 is 
the (external) composite 

XxA Bia XxB ce eet C, 
where f’ and g’ are the transposed maps of f and g. 

If € and €’ are topoi, a logical morphism T: E —> E' is a functor 
which preserves, up to isomorphisms, all the structures required to define 
a topos; specifically, it preserves all finite limits, the subobject classifier 
and the exponential, each up to isomorphism. 

Recall that T preserves all finite limits when, for every limiting cone 
kK: C > F for a functor F: J > E from a finite category J, the compos- 
ite cone TK: TC — TF is a limiting cone for TF’. Since all finite limits 
can be constructed from pullbacks and a terminal object, it is enough to 
require that T carry each pullback diagram in € into a pullback diagram 
in €’ and that T carry the terminal object in € into a terminal object 
in €'. 

Specifically, if x = XxY Ee Y are the projections of a given 
product diagram in €, preservation of the product means that 


TX "7 (X XV) — TY 
is a product diagram in €’; in case a product diagram for TX x TY is 


already at hand in €’, this means that there is a unique (or “canonical” ) 
isomorphism T(X x Y) = TX x TY preserving the projections, 
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Similarly, to say that T preserves the subobject classifier true: 14 0 
of € means that T(true): T1 > TO is a subobject classifier for €’. Also, 
to say that T preserves exponentials means that for any two objects B, 
C of € with exponential C? and evaluation e, the object T(C”) is an 
exponential in €’, with evaluation map 


T(B) x T(C®) = T(B x C®) "£4 TC, 


where the first isomorphism is that arising because T already is known 
to preserve products. In view of the construction just completed for 
exponentials in terms of power-sets, it is sufficient to assume instead 
that T preserves power-sets: For each object B of € with power-set PB 
and “membership” relation €g: B x PB — (as in (iv) of §1, the image 
TPB is a power-set for TB, with corresponding membership relation 


TB x TPB T(Bx PB) ~£25T9. 


3. Direct Image 


The direct image of a subset S’ C B’ under a map k: B’ — B of sets 
is described with an existential quantifier as {6 | 3b’ € S’, k(b’) = b}. 
A corresponding construction of the direct image under an arrow k ina 
topos will be given in 86. For the present we treat only a monic k. For 
each monic k: B’ — B in a topos €, we now describe an arrow 


3.: PB’ — PB (1) 
which will correspond to the intuitive idea “direct image (of a subobject 


of B’) under k”. We will first do this in purely elementary form. The 
construction is contained in the diagram 


gS 


B' x PB’ ——->9 true 

€pr 
ba (2) 
Bx PB! --------,-------- >Q 
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Starting from €,-, take the upper (small) square to be a pullback; this 
gives the object U and the monic ug (to be considered as the member- 
ship relation on B’). Then construct ex = char((k x 1)ug’) so that 
the large square is a pullback, and finally define 4, at the bottom 
to be the P-transpose of this map e;. In the topos Sets, U is then 
{(0',S") | & € S’ Cc B’}, so ex(b, S’) is true when b = kb’ for some 
b! € S’ and then 3; S” is the set of all such elements 5; that is, the direct 
image of S’ under k. The same fact may be stated in any topos by 
describing the action of 3, on the names of characteristic functions in 
the following way 


k 
Proposition 1. For monics SB B ina topos, 
4," charm? = “charkm™: 1 PB. (3) 


In other words, 4, carries the name of any subobject m of B’ to the 
name of its image km. 
It will suffice to prove the transposed equality 


ex(1 x “char m7) = char(km): B x 1——> 2. (4) 


(we identify B with Bx 1 by the canonical isomorphism). To do this we 
show that both of these predicates characterize the same subobject of 
B. So we will construct the pullback of true along the left-hand arrow 
of (4) by adding a left-hand side to the diagram (2), as in 


S x1 ------ aaa » U ————— 1 = 1 
dices up 
Ge 
X x 1——> B’ x 1_—_——_> B’ x PB’ —— 9 true (5) 
uxl lx charm Ep 


bn ps 


- 
BX 1 agar? BX PB’ ——, > 0. 


The large rectangle on the right is a pullback, by the definition of e, in 
(2), while the small square bottom left is a pullback for formal reasons. 
Through the middle the composite is 


Ep (1 x “charm”) = charm: B’ = B’x 1-0 (6) 


by the definition of "char m” as the transpose of charm. But (charm) © 
m = trues; since U is a pullback, this means that we can insert the 
top left-hand square with S x 1 and an arrow w to make the top left 
square commute. We now claim that the top rectangle, composed of 
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two squares, is a pullback. This means that any arrow u: X — B’ 
with €g/(1 x “char m”)(u x 1) = truex must factor uniquely through m. 
Using (6) again, u satisfies (char m)u = truex. In turn, this means that 
u factors through m as u = mu’, and as displayed top left. Since the 
rectangle is now shown to be a pullback, so is the square top left. The 
big square must now be a pullback, and this means that the subobject 
of B characterized by the left-hand map in (4) is Sx 1 B’x1— Bx], 
but this is the same as the subobject S — B’ — B characterized by the 
right-hand side of (4), as was to be shown. 

This direct image 3; for power objects PB’ has an analog k for 
Sub B’. The pullback of a composite is trivially a composite of pullbacks, 
as for the pullback squares in the diagram 


C A B" 
Cc’ Sar B' (7) 


C—;—B 


in any topos €. Now suppose in this diagram that both k and k’ are 
monic. The composite k ok’, regarded as a subobject of B, is then the 
usual “direct image” of the subobject k’ under k. Let 


ky: Subg Bi Sube B 


denote this direct image operation. The diagram (7) above also exhibits 
the pullback operation Sube(g): Sube(B) — Subg(C) and yields the 
equation 


(Sube(g)) kik’ = (Sube(g))(k ok’) = mm! = mm! = m(Sub g')k’; 


in words, the pullback along g of the direct image under k is the direct 
image under m of the pullback along g’. This asserts that if k is monic in 
the pullback square gm = kg’ of (7) above, then the following diagram 
is commutative 


Sub B’ 882", sub cr 
je 


Sub B Subs Sub Cc 
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in the category of sets. This conclusion is the “external” Beck-Chevalley 
condition for sets of subobjects. 

Now the corresponding conclusion will hold internally—that is, for 
the internal power-set objects PB as opposed to the “external” ones 
Sub;(B). This result, the “internal” Beck-Chevalley condition, will be 
useful in the construction in §4 below of colimits in €. 


Proposition 2 (The Beck-Chevalley Condition for 3). If m is the 
pullback of a monic k along an arbitrary arrow g in a topos €, as in the 
square left below, then the right-hand square below will commute 


Cc’ 2, B’ PB —?2_, pc! 
C——B, PB PC. 
Pg 


Proof: The desired equality Pg oA, = dm oPg! will follow from the 
equality e,(g x 1) = €m(1x Pg’) of their transposes, or from the equality 
of the subobjects characterized by those transposes, as in the diagrams 


[eee ee eee 205 1 


C x PB’ —_—— B x PB' —,— 727, 
gx1 ke (10) 


? ------------+ > Ug ————> 1 


1 
{ 
| ies | 
t 
¥ 


C’ x PB’ EEPR GH x PC’ 


mt ma 


C x PB’ —_—>C x PC! —— > 0. 
1xPg m 


In the upper diagram, the left-hand square is obtained by applying — x 
PB’ to the pullback square of (9); in the lower diagram, the left-hand 
square is formally a pullback. Both vertical rectangles are pullbacks. 
It remains to fill in the top left small pullback squares. But €, is 
“dinatural” ((CWM, p. 214], and (1.7)) in its argument B, meaning 
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simply that the diamond 


"x PB' 


B 
g’x1 
C’ x PB’ 
oN 
C 
always commutes. This in turn means that true: 1 > 2 pulled back 
along either the top or the bottom composite of the diamond gives the 
same result. Pulled back first along €g or Eg: it gives Ug or Uc, as in 


(10) above; hence the two missing top left vertices of the two diagrams 
of (10) above will be the same. This proves the proposition. 


Ne 
QD 
"x PC' 


Corollary 3. Ifk: B' — B is a monomorphism, then the composite 


PB’ —*_, pp—P4, pp’ (11) 
is the identity. 
Proof: For any monomorphism k, the square 


B' 1 B 


B' as ok B 
is a pullback; moreover, 4; = 1 and P1 = 1, hence the Beck-Chevalley 
condition (9) gives the result. 


We would like to emphasize that, in this section, we have constructed 
the map 3,: PB’ — PB for a monomorphism k: B’ — B by purely 
elementary means; this is an illustration of the fact that it is perfectly 
Possible to develop the elementary theory of a topos in a way that does 
not depend on an ambient set theory. 

If, on the contrary, one implicitly assumes that the topos € is lo- 
cally small (or one works with some appropriate comprehension princi- 
ple), the map (1) can also be constructed using the functors Sub and 
Hom, as follows. Suppose we are given a monomorphism k: B’ > B. 
Then for any object X of €, k induces by composition, as in (7), 
an operation (k x 1):: Sube(B’ x X) > Subg(B x X). Hence, by 
the natural isomorphisms (1) and (2) of §1, we obtain an operation 
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Im,: Home(X, PB’) — Home(X, PB), again natural in X, such that 
the diagram 


Home(X, PB’) —=— Sub¢(B’ x X) 


in| |e (12) 
Hom, (X, PB) —~—> Subg(B x X) 


commutes. By the Yoneda lemma, the latter operation must be induced 
via composition by a uniquely determined map PB’ — PB, which we 
call 3,. To recover an explicit description of this map PB’ — PB, 
we apply Im, for X = PB’ to the identity. But lpg: PB’ — PB’ 
corresponds to the subobject of B’ x X = B’ x PB' obtained by pullback 
along €: B’ x PB’ > Q, which is precisely ugr: U — B’ x PB’ of (2) 
above. Chasing further around (12) will then show that this definition 
of 3: PB’ — PB agrees with the earlier one. 

This illustrates the way in which set-theoretic arguments, using the 
sets Sub and Hom, can by Yoneda be translated back into elementary 
language. Later on, in (9.9), we will construct a direct image map 
PB' — PB for any morphism k: B’ — B (not necessarily monic), and 
prove a Beck-Chevalley condition in that generality. 


4. Monads and Beck’s Theorem 


By definition, a topos has all finite limits. In the next section, we 
will prove that it also has finite colimits. The argument requires some 
background concerning monads, which we will review in this section. A 
more detailed presentation can be found in [CWM, pp. 133-151]. The 
reader willing to assume that all his topoi (like those in Chapter I) have 
finite colimits may skip this section and the next—but only at the cost 
of missing an elegant pair of theorems. 

A monad (or triple) in a category C consists of an endofunctor 
T: C > C and two natural transformations 4: T? > T and 7: I > T, 
where J is the identity functor, such that the following diagrams of func- 
tors and natural transformations commute 


ps gt pp pe ry 
nf rf SoZ (1) 
T*? —,— T, T. 


These are exactly analogous to the diagrammatic definition of a monoid 
T in the category of sets. The first diagram expresses the “associative” 
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law for the “multiplication” yw, and the second the “identity” law for the 
“anit” 7 of the monad. 
An arbitrary pair of adjoint functors 


Cc = A F4G (2) 
with unit 7: Ic — GF and counit «: FG — Ia determines a monad 
(T,n, w) in C with T = GF, n = n, and 

Lo = Gero: GFGFC >= GFC =TC (CEC). 


Next we introduce algebras for a monad T in a way which directly 
generalizes the (left) actions of a monoid. Thus, if the monoid M (in 
Sets) has multiplication v: M x M — M and unit e: 1 > M, a left 
action of M onaset Y isa map h: Mx Y — Y such that the diagrams 


MxMxyY 2*+.Mxy yYorxy—1,MxyY 
vs] [ and | 
M x Y ———_+, Y 


both commute. In particular, any set X determines the set FX = Mx X 
with the action h =vx1:MxMx X — M x X, called the “free” 
action. If BM is the category of all left actions by M on sets, then this 
functor F': Sets — BM has an evident adjoint (the forgetful functor) 
and so defines a monad (T, 7, 4) on Sets, with TX = M x X. 

More generally, given any monad (T, 7, js) on @ category C, we imi- 
tate the above definition of an action by constructing the category C7 of 
T-algebras. Its objects are pairs (C,h: TC’ — C), where C is an object 
of C and h is a morphism such that 


Tc —.TC COS227C 
rel I and Sc [ (3) 
TC——>C C 


commute; a morphism f: (C,h) — (C',h’) of T-algebras is a map 
f: C=C’ in C such that h’ oT f = f oh. There is an obvious forgetful 
functor 

GP:CT5C, (GA), 


which has a left adjoint F?: C — CT sending an object C’ to the cor- 
responding “free algebra” F™C = (TC, yc: T?C = TC). With the 
evident unit 77 and counit e", this adjunction determines a monad on 
C, which is precisely the monad (T, 7, 4) we started out with. 
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Given a functor G: A > C with a left adjoint F: C — A, we may 
thus construct a monad (7,7, 4) on C and a diagram of categories and 
functors 


A—*_.c? 


i ele ° 


where the so-called comparison functor K is defined by 
KA= (GA, Ge,: GFGA > GA), ACA. (5) 


Thus, Ko F = FT and G’ oK =G. 

A functor G: A — C is said to be monadic if G has a left adjoint 
F and this comparison functor K is an equivalence of categories. (The 
definition in [(CWM, p. 139], is more restrictive, requiring that K be an 
isomorphism of categories.) 


Proposition 1. A monadic functor creates all limits. 


Proof: Let G: A — C be monadic. Then by definition, G is the 
forgetful functor G? : CT — C, up to an equivalence of categories A = 
C’. It thus suffices to show that such a forgetful functor G7: CT — C 
creates limits (cf. Exercise 2 of [CWM, p. 138]). By the definition 
of “creates” ((CWM, p. 108]) this means that we have to prove the 
following: given a functor H: J — CT? and a limiting cone r: C > 
G? oH in C for its composite with G’, there exists a unique object A 
in C? with a cone 0: A — H which is mapped by G® to the original 
cone 7; moreover, this cone o is a limiting cone. Now 7 consists of 
suitable arrows 7;: C > G7 H; (for each object j € J), while each H; is 
a T-algebra, say H, = (C,,h,: TC, — Cj). Thus, G7 H; = C,. Because 
7 is a limiting cone in C, there is a unique arrow h: TC — C such that 
the diagrams 


for 7 € J all commute. One verifies readily that (C,h) is indeed a 
T-algebra, while this diagram states that each 7, is a morphism of T- 
algebras. Hence the 7;: (C,h) > (Cj, h,) = H, do form a cone oa in the 
category C™, the unique cone with G'o = Tr. It is easy to verify that 
this cone ¢ is indeed a limiting cone in C’, as required. 
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This proposition includes the familiar fact [CWM, p. 108] that the 
forgetful functor from groups to Sets, and similar forgetful functors from 
categories of algebras, do create all limits. It also shows that a monadic 
functor G: A — C to a complete category C has a complete domain A. 

The following version of Beck’s theorem gives conditions on a functor 
G which ensure that G is monadic. To state the theorem, we need the 
notion of a reflerive pair: it is a pair of arrows s, t: A B in a category 
A such that there exists an arrow i: B — A with si = lg = ti. For 
example, in Sets each binary relation R C Sx S ona set S determines a 
pair of arrows R= S, and the relation is reflexive, in the standard sense 
for sets, if and only if this is a reflexive pair in the sense above. 


Theorem 2. Let G: A — C bea functor with a left adjoint, T the 
corresponding monad in C, and K: A — CT? the resulting comparison 
functor, all as in (4). 


(i) If A has coequalizers of all reflexive pairs, K has a left adjoint L. 
(ii) If, in addition, G preserves these coequalizers, the unit of this 
adjunction is an isomorphism Igr = K oL, 
(iii) If, in addition to (i) and (ii), G reflects isomorphisms, then the 
counit of this adjunction is also an isomorphism Lo K & Ta. 
Consequently, G is monadic in this case. 


Recall that G is said to reflect isomorphisms if, for each arrow t of 
A, t is an isomorphism whenever Gt is. 

Proof: (This is Exercise 3 of [CWM, p. 151] and we onl, give an 
outline.) 

(i) The left adjoint L is constructed as follows: given a T-algebra 
(C,h: GFC — C) where F is the left adjoint of G and T = GF, take 
L(C, h) to be the coequalizer 


FGFC = FC —2 L(C,h) (6) 


in A, where ¢ is the counit of F' 4 G; this coequalizer exists because the 
pair (Fh, €rc) is reflexive, as witnessed by Fnc: FC —~ FGFC. One 
then proves L left adjoint to K. 

(ii) If (C, A) is an algebra, the top row of the diagram below is a split 
coequalizer ([CWM], (4) on p. 148), and the unit \ of the adjunction 
L + K is the unique dotted arrow filling in (7): 


GFGFC a GFc —_*+—.¢ 
eFC : 
| | iA (7) 


GFGFC == GFC —z—> GLC, h). 
€FC 
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Indeed, GL(C, h) is the underlying object of the algebra KL(C, h), and 
one can check that \ defines an algebra map (C,h) —~ KL(C,h). But 
if G preserves the coequalizer, both rows of (7) are coequalizers, so is 
an isomorphism. 

(iii) For an object A of A, LK A fits into a coequalizer (8), 


FGFGA ——*4 FGA——» LKA 
ee (8) 
A 
and the counit of the adjunction L 4 K is the unique factorization 
ka: LKA-— Aas indicated in (8). But 


GFGFGA = GFGA —G4.4GA 
€FGA 


is a split coequalizer, so if G preserves the coequalizer (8) defining LK A, 
then Gx, is an isomorphism, and hence so is Kk, if we assume that G 
reflects isomorphisms. 


To summarize, we have: 


Corollary 3. If the category A has coequalizers of all reflexive 
pairs, while the functor G: A — C has a left adjoint, reflects isomor- 
phisms and preserves coequalizers of reflexive pairs, then G is monadic. 


In this proof and the corollary, “reflexive pair’ may be replaced 
throughout by “parallel pair” f, g: A> B in A such that Gf, Gg fit 
into a split coequalizer in C 


5. The Construction of Colimits 


We can now show that every topos € has all finite colimits—in par- 
ticular, has an initial object 0, sums (= coproducts), coequalizers, and 
pushouts. The proof uses both finite limits and the fact that the power- 
set functor P is its “own” left adjoint, and so defines a monad in €. 


Theorem 1. The functor P: E°? — E has a left adjoint; namely, 
PP: E — EP, 


Proof: Recall first that each category C determines its “opposite” 
category C°P (same objects and arrows reversed), while each functor 
T: C — D determines an opposite functor T°?: C°? > D°? (with the 
same object and arrow functions). Thus in particular the power-set 


5. The Construction of Colimits 181 


functor P: €°P — € yields also the functor P°P, which is the “same” 
functor, but considered as acting on €, not on €°P. The asserted ad- 
junction follows because the product is commutative according to the 
familiar canonical isomorphism +: A x B = B x A; this yields the fol- 
lowing sequence of natural isomorphisms of hom-sets: 


E(A, PB) © E(B x A,Q) © E(B, PA) = E€°?(PA, B). (1) 


The result expressed in (1) is often formulated as “P is adjoint to 
itself on the right”. Generally, functors $: C°? — D and T: D9? — C 
are said to be adjoint on the right when there is a natural isomorphism 

D(B,SA) = C(A,TB), AéeC, BeD. 

Theorem 2. In the adjunction of P°? to P, the unit n: I + PP9°P 

is for each object A of € that arrow na: A— PP°PA such that 

€pa(1 x na) = €ay: PAX A> 7D, (2) 
where + is the canonical map interchanging the factors PA and A. The 
counit €g: P°? PB — B for B in €°P is (np)°?. 


Proof: Just apply the usual calculation of the unit of an adjunction, 
by setting B = PA and following the identity arrow of PA through (1), 
using the formula (1.6) for the transpose. Both the triangular identitie= 
for the adjunction come down to the identity 


Pna°onpa = 1pa. 


In Sets, the definition (2) for a € A and S C A reads: S € naa 
if and only if a € S; in other words, a and S' are simply interchanged 
in the membership relation. The unit is thus the mapping of A to its 
double power-set given as 

naa={S|aeScA}e PPA. (3) 
This is related to the “Stone duality” for sets. 

Theorem 3. The power-set functor P: E°P — E is monadic. 

Proof: First, P is faithful. To each arrow h: B — A in € we 
construct first a monic (1,h): B— Bx A, next the characteristic map 
of this monic, and finally the P-transpose of that map, all as in the 
following diagram, where the squares are pullbacks: 


B—*_., 4A-—___> 1 


on] a,| | 


Bx Aq? Ax A— 2 (4) 


Aa tA Pe. 
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Hence Ph = Pk for arrows h, k: B — A implies that Pho {-}4 = 
Pk o {-}4 and hence that h = k. Thus P is faithful. (This argument 
amounts to checking the effect of Ph on singletons.) 

Now we apply Beck’s theorem as stated in the previous section. First 
note that €°P has coequalizers of reflexive pairs because they are just 
equalizers in €, and € has equalizers of all pairs. Next, because P is 
faithful, it must reflect both monics and epis. But, by Proposition 1.2, 
an arrow is an isomorphism if and only if it is both monic and epi. Hence 
P reflects isomorphisms. Finally, consider a coequalizer in €°P of some 
reflexive pair; this means that we have in € an equalizer diagram 


C-2+B==3A (5) 


and an arrow d: A — B with dh = dk = 1p (a “coreflexive pair” in €). 
We wish to prove that 


PA—"> PB—*1, PC (6) 
is a coequalizer (in €). But the commutative square 


C—+ +B 


in € is a pullback. For, if f, f’: D— Bhavehf =kf’, then dhf = dkf’ 
so that f = f’; since g is the equalizer in (5), there is a unique s: D 4C 
with f = gs = f'. Now g is monic and (by virtue of d) so are h and k. 
Proposition 3.2 and Corollary 3.3 (the Beck-Chevalley condition) then 
imply that the following diagrams commute: 


PB—?2-,PC  PC—*%+.PB  pA—_3*_, PB 


 ONUR ONG 


PA 3 PB, PC, PA. 


An easy calculation from these three equations shows that (6) is a co- 
equalizer; in more technical language, these equations for the two arrows 
dn, dg, PA — PB -<— PC backwards in (6) make (6) a “split fork” in 
the terminology of [CWM, p. 145], hence a coequalizer. Therefore P 
preserves coequalizers of reflexive pairs, and hence is monadic, by The- 
orem 4.2. 


Corollary 4. A topos € has all finite colimits. 
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Proof: Let T = PP°? be the monad defined in € by the power-set 
functor, and €7 the corresponding category of T-algebras. The forgetful 
functor €? — € creates limits. If J is any finite index category, € has 
all J°P-limits (all limits of functors on J°P to €). Therefore E7 also 
has all J°P-limits. But, since P is monadic, €°P is equivalent to ET, 
and equivalences of categories preserve all limits. Therefore €°P has all 
J°P-limits, so € has all J-colimits. This proves Corollary 4. 


By following through the steps of the proof above one can obtain 
a direct description of each particular colimit in €. We will not use 
this direct description, but its formulation will illuminate the argument. 
Thus given H: J°P — €°P, form the composite PH: J°P — E€ and take 
its limit ¢ in €, with limiting cone r: t —- PH. With the counit ey in 
€°P , there will be in € a unique arrow h to the limit so that the diagram 


prs | (in €) 
PPO? PH ime PH 
€H 
commutes. Now apply P°P here and form the diagram 
PEPPrt = Pt --------- wv) 
more |p lo (in EP) 


P°P Pex 
P°? PP? PH ——— PRP =a a, 


first choosing @ as the coequalizer (in €°P) of the pair displayed in the 
top row, then noting that the two composite natural transformations in 
the bottom row are equal and that both squares on the left commute, 
so that there must exist a unique cone o, given by the vertical arrow at 
the right, making the right-hand square commute. The assertion then 
is that o°P: H°P — € is the colimiting cocone for H°?: J > E. 

The proof that this is indeed a colimit is a direct translation of the 
proof of Theorem 3 and its corollary above. This translation would in- 
volve exactly the steps above without explicit mention of the T-algebras 
which we have used; in fact, however, the h constructed above does make 
t into a T-algebra (t,h), while @ is the value L(t, h) of the left adjoint to 
the comparison functor, and all the pertinent properties of such algebras 
reappear in the translated proof. The diagrams above may be made to 
look simpler by putting the second diagram into € (reversing the arrows) 
and writing P for P°P everywhere, as is the usual custom. 

For example, to get the coequalizer of f, g: b — a in €, first take the 
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equalizer + of Pf and Pg as in the right-hand column of the diagram 


P?q—P&4-+ Pa 


I ear 
P*b —=— Pb, 


use its property as an equalizer to construct the arrow hf in the top row 
above, and then form another equalizer @ as the top row in the diagram 


Now use the equalizing property of £ to construct o as in the left-hand 
column. This arrow o is then the desired coequalizer of f and g. 

The construction of the initial object 0 in € is briefer: Take the 
terminal object 1 of €; there is a unique arrow !: P?1 — 1, and 0 is the 


equalizer 
ePl 


Q ----- > PITS PAL. 


The reader may wish to convince himself that for € = Sets this con- 
struction really does produce the empty set! [Use the description (3) 
for €.] 


6. Factorization and Images 


In the category Sets, every function can be written as a surjection 
followed by an injection; i.e., as an epimorphism followed by a monomor- 
phism. Because we now have finite colimits in any topos, we can prove 
that this factorization holds in any topos: Every arrow factors as an epi 
followed by a monic. 

Call a monic m the image of the arrow f if f factors through m, say 
as f = me for some e, and if, whenever f factors through a monic h, 
so does m. This says in effect that m is the smallest subobject (of the 
codomain of f) through which f can factor. 
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Proposition 1. In a topos, every arrow f has an image m and 
factors as f = me, with e epi. 


Proof: Given f: A — B, construct the following commutative dia- 
gram in stages 


f 
i —— 
A ---5--> M ---5--»B ee e 
| i 
( 
A—,;—-N —= Bessrtecee 


First, take the cokernel pair x, y of f; this is a pair of arrows x, y from B 
to some object, universal with the property xf = yf. Hence, it can be 
obtained as a finite colimit; indeed, it can be described as the pushout 
of f with f. Let m, with domain M, be the equalizer of this pair z, 
y. Then zm = ym and m is monic; since ef = yf, the original f must 
factor through the equalizer m as f = me for some arrow e as displayed 
in the top row above. 

Now take any other factorization f = hg with h monic, as displayed 
in the second row of the diagram. By Proposition 1.1 for a topos, the 
monic hf is an equalizer, say the equalizer of the two arrows s and ¢ 
as displayed in the second row of the diagram. Then sh = th and, 
therefore, sf = tf; because x, y is the cokernel pair of f, there must 
be a (unique) arrow u, as displayed vertically, with s = ux and t = uy. 
Therefore, sm = uxrm = uym = tm, so that m must indeed factor 
through the equalizer A of s and ¢. This proves that m is the image of 
f, as asserted in the proposition; it remains to show that e is epi. First 
observe that when this image m is an isomorphism, f must be epi, for 
m is the equalizer of x and y, so m an isomorphism implies x = y, so 
that the cokernel pair of f is 2, x and f is therefore epi, as claimed. 

Now return to the factorization f = me and take the image m’ of 
the first factor e, so that f is a composite 


This means that f factors through the monic mm’; therefore so does its 
image m, say as m = mmv for some v. This implies that 1 = m!v, so 
the monic m’ is an isomorphism. As observed just above (in case of f) 
this means that e is epi, and the proof is complete. 


This factorization is functorial, in the usual sense: 

Proposition 2. If f = me and f’ = m'e! with m, m’ monic and e, 
e’ epi, then each map of the arrow f to the arrow f’ extends to a unique 
map of m, e tom’, e’. 
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Proof: A map of the arrow f to the arrow f’ is a pair of arrows 
r, t which make the resulting square on f and f’ commute, as in the 
diagram on the left below. Given such a pair of arrows and the two e-m 
factorizations, we must construct a unique arrow s from m to m’ which 
makes both squares in the diagram on the right below commute: 
f 


A—— +B A—*—> C —*— B 
Le 
r t r 8 P t 
Uae 
Ae A’ —_ C' ——_ B’. 
e m 


Suppose first that e and m are the maps found in Proposition 1, so that 
m is the image of f. Take the pullback P of ¢t along m’, as sketched at the 
right; here P — B is the pullback of a monic m’, hence is monic. By the 
commutativity of the left-hand square and the definition of a pullback, 
f must factor through P — B. Therefore, by the minimal property of 
the image, m must also factor through P — B, which implies that tm 
factors through C” via an arrow s, as tm = ms, as shown. Because m’ 
is monic, this arrow s is unique with this property. For the same reason, 
se = e’r, so the left-hand square of the rectangle above also commutes. 

In particular, given a second factorization f = e’m’ of the same 
arrow f, this argument, with r = t = 1, yields a unique arrow s which 
is both monic, because m’s = m, and epi, because se = e’, hence an 
isomorphism. This states that the factorization f = me, which we 
constructed, is unique “up to isomorphism”, so the construction of s 
above applies to any epi-monic factorization of f. 


Proposition 3. For each object A in a topos the partially ordered 
set Sub A of subobjects of A is a lattice. Moreover, for each arrow 
k: A — B, pullback along k is a morphism k~!: Sub B — Sub A of 
partially ordered sets, i.e., a functor; this functor k™! has as left adjoint 
the functor 3, which sends each subobject S of A to its image in B 
under k. 


Actually, the lattice Sub A is a Heyting algebra, as we will show in 
Theorem 8.1 after we have studied slice categories in §7. 


Proof: Given two subobjects S — A and T — A we can form their 
intersection as their greatest lower bound (g.l.b.) in Sub A simply by 
taking the pullback, as on the left below. 


sor —— aT S27. FT 
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To get their union or least upper bound (l.u.b.) in Sub A we first form 
the coproduct (sum) S + T in the topos, as in the right-hand square 
above; by the definition of a coproduct, the monics S — A and T— A 
then determine uniquely an arrow S +7 — A which need not be monic, 
but which by Proposition 1 has an image M as displayed. This M — A 
is then a subobject of A which clearly contains both given subobjects S 
and T. The minimal property defining the image M then readily shows 
that M is a least upper bound of S$ and T in Sub A, so M = SUT. 
Therefore, Sub A is a lattice (actually, a lattice with zero 0 — A and one 
A— A; the former is monic by Corollary 7.5 below). 

Next consider k: A — B. Since the pullback of a monic T— B 
along k is necessarily a monic S — A, and since pullback clearly carries 
inclusions of subobjects to inclusions, it is a morphism k~! of partially 
ordered sets. 

To construct the left adjoint 3, of k~, recall that for the topos 
Sets there was such a left adjoint sending each subobject S of A into 
its image under k (Theorem 1.9.2). Since we have images at hand, the 
same construction can be carried out in any topos: For each subobject 
u: SA, its image under k is the image of the composite arrow ku, 
hence a subobject of B written as m: 4,5 — B, with ku = me for some 
epi e. Put this, any subobject v: T— B and its pullback k~!T all in 
the diagram 


k B; 


the outer rectangle is commutative and the middle square is a pullback. 
From this we read off a correspondence 


g € Homa (3,S,T) + f € Homa(S,k7™ 7) 


as follows. Any map g of subobjects of B, as shown, has vg = m and 
hence vge = me = ku; since k~'T is a pullback, this determines a 
unique f which makes the diagram commute. Conversely, each map 
f: S 3 kT of subobjects of A yields by the diagram a factorization of 
ku through the monic v; since m is the image of ku, it too must factor 
through this monic v, say as m = vg for some g, necessarily unique. 
This shows that the correspondence g++ f above is a bijection—hence 
an adjunction, exactly as for € = Sets. 
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This completes the proof of Proposition 3. In case k: A > B is 
monic, this left adjoint 3, is exactly the direct image k; introduced 
below (3.7). In particular, for such a monic k, 


k713, =1: Sub A — Sub A. 


Since for k: A > B, the pullback functor 
k-1: Sub B > SubA 


has a left adjoint 3,, it follows that k~! preserves finite limits. In par- 
ticular, 
k-(SNT) =k (S)Nk7 (7) 

for any two subobjects S$, T of B. In other words, k~!; Sub B — Sub A 
is a homomorphism of “meet-semilattices”’. Another way of expressing 
this is by saying that the meet operation {): Sub(B) xSub(B) — Sub(B) 
is natural in B. Under the isomorphism Hom(B,2) © Sub(B), again 
natural in B, we thus obtain an operation A, making the following 
diagram commute: 


Sub(B) x Sub(B) ——1—- Sub(B) 


a 


Hom(B,) x Hom(B, 2) (2) 


a 


Hom(B, N x Q) —— ie Hom(B, Q). 


This operation A, is again natural in B, so by the Yoneda lemma (take 
B= xQ and apply (Az to the identity) A, comes from a uniquely 


determined map 
A: 28x29, (3) 


via composition. In other words, if the subobjects S and T of B have 


characteristic maps s and t: B = , then SMT has characteristic map 


(8,t) A 
B—+2x 2 — QO, written briefly as s At. One calls A in (3) the 


internal meet operation; it makes (Q, A, true: 1 — Q) into an internal 
meet semilattice object in the topos. (We’ll come back to this in §8 
below.) 

Similarly, for a fixed object B, the meet operation MN: Sub(B x X) x 
Sub(B x X) > Sub(B x X) is natural in X, so under Sub(B x X) = 
Hom(X, PB) one obtains an operation 


Ax 
Hom(X, PB x PB) = Hom(X, PB) x Hom(X, PB) —> Hom(X, PB), 
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again natural in X. By Yoneda again, this corresponds to a map 
A: PB x PB = PB, (4) 


which is the “internal meet” on PB. This internal meet is also natural 
in B, in the sense that for any k: A > B, the diagram 


PBx PB—A—> PB 
PPh) [Ps (5) 
PAx PA——>PA 


commutes (Exercise 8). 

Since any object has a unique arrow to the terminal object 1, an 
object S of € is a subobject of 1 precisely when the unique map S — 1 
is monic. Call an object U open in € whenever U — 1 is monic. In 
case € = Sh(Y) for some topological space Y, the open objects are 
those sheaves which correspond exactly to the open subsets of Y (see 
Proposition IT.2.4). 


Proposition 4. In a topos E the lattice Sub 1, regarded as a cate- 
gory, is equivalent to the full subcategory Open(E€) of all open objects 
of €. An object U is open in € if and only if there is for each object X 
at most one arrow X — U. 


As for the last sentence, there is for each X exactly one arrow X —> 1. 
Hence if U — 1 is monic, there can be at most one arrow X — U, while 
if an object U has this property for all X, the unique arrow U — 1 is 
necessarily monic. 

A subobject of 1 is an equivalence class of monics;, choosing one monic 
in each equivalence class yields a functor Sub(1) — Open(€) which is 
clearly an equivalence of categories; in fact Sub(1) is just the skeleton 
[CWM, p. 91] of the category Open(E). 

Similarly, for any object B of € a subobject S — B is just an open 
object in the slice category €/B so that the inclusion 


Sub(B) — Open(€/B) 


(which depends on a choice of a representative of each equivalence class 
of monics S — B) is an equivalence of categories which makes Sub(B) a 
full subcategory of E/B. 


Proposition 5. For any object B in a topos €, the inclusion 
i: Sub(B) > €/B has a left adjoint o which sends each f: A > B 
to its image. 
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Proof: For each object f: A — B of the slice category €/B take 
of to be the image m: MB, regarded as a subobject of B. By 
Proposition 1, whenever f factors through a monic h: C’ — B, so does 
m. This states exactly that Home/s(f,h) = Homguyay(™, h), hence 
that o defined by of = m is the required left adjoint to the inclusion 7. 


7. The Slice Category as a Topos 


For a fixed set B, regarded as a discrete category, there is an equiv- 
alence of categories (§1.1.9) 


Sets? = Sets/B. 


An object of the functor category on the left is a B-indexed family 
{ X, | b € B} of sets; the equivalence replaces this family by the function 
f: X =|][X, - B from the disjoint union of the sets X,. Since the 
functor category is a topos, so is the slice category of sets over B. This 
last result holds when Sets is replaced by any topos as in the following 
basic theorem: 


Theorem 1. For any object B in a topos €, the slice category E/B 
of objects over B is also a topos. 


Proof: Given two objects f: X — B and g: Y — B over B, the 
equalizer of two arrows X 3 Y in €/B is clearly just the equalizer in €, 
equipped with the evident map to B, while the product of f and g in 
€/B is just the pullback 


X pi Bey 


F |) (1 


i —— 8 


in € with projections p and g on its factors in €/B. The terminal object 
in €/B is the identity 1: B — B. Since a subobject of X — Bin €/B 
is (essentially) just a subobject of X in €, the subobject classifier 1 in € 
yields at once a subobject classifier 2 x B — B, with arrow the second 
projection, in €/B. Hence €/B has all finite limits and a subobject 
classifier, so it remains only to prove that it has power objects—and, 
therefore, exponentials, by Theorem 2.1. 

Given two objects f: C — B and g: D > B in €/B, we wish to 
construct an object Pgf, the power-object of f, so that for the hom- 
sets Homg in €/B one has a bijection 


Homa(C xz D,Q x B) Y= Homa(D, Psf) (2) 
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which is natural in C' and D. To this end, consider the fiber product 
C xg D as a subobject of C x D in €, and compute its characteristic 
map ¢: Cx D— 0) by the following commutative diagram in €: 


C xg D > B —____—_ 1 


| (i) [+e (ii) fom 


fxg 6B 
(iii) | 
Cx pcx BAB. BUS Bx PB. (3) 


fx 1 (iv) Jee 


1 C x B——a tC x PB app © x PC 


(v) 


CO x D Cx PC. 
xw 


Here the top left rectangle, numbered (i), is just an expression of the 
definition of C' xg D as the pullback of C —+ B — D, while the top 
right rectangle (ii) is a pullback by the definition of the Kronecker delta 
6g as the characteristic function of the diagonal Ag. Thus the whole 
top rectangle is a pullback, so the characteristic map ¢ sought is the 
composite ¢ = da(f xg). The rest of the (commutative) diagram simply 
calculates this map to be 


Ec(1 x w): Cx DQ, where w= Pfo{-}30g: D— PC. (4) 


Indeed, the flat rectangle (iii) at the right is just the definition (1.13) 
of {-}, as the P-transpose of 6g, the square (iv) on the right is the 
definition (1.7) of the action of the functor P on an arrow f (€ is dinat- 
ural), and the region (v) lower left exhibits the formula (4) above which 
defines w. 

With this w we now analyse the left-hand hom-set in the desired 
bijection (2): 


Homa(C xg D,Q x B) = Home(C xg D,Q) 


& Sube(C xg D) 
© {S| S © Sube(C x D) and SC CxpD}; 


here each subobject S of C x D can be interpreted as an arrow (char- 
acteristic map) A in the lattice Home(C x D,(). Using the intersection 
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operator of that lattice [see (2) and (3) of §6 above] the characteristic 
map (4) gives a further bijection 


2{h|h: Cx D>Mand kA Ec(1 x w) = hh}; 
the P-transpose to the lattice Home(D, PC) then gives 
= {k|k: D— PC and kAw=k}. 


Here we express the intersection operation by the intersection arrow 
A: PC x PC — PC [see (4) of §6] and use the definition w = Pf{-}ag 
of (4) to write 


kw = \o(k, Pf o{-}e0g) =Ao(1x Pf{-}s)o(k,g) =t0(k,g), 


where t = Ao(1 x Pf{-}g) as in the square displayed below. Thus 
we have a further bijection to the set of those arrows k such that 
t(k,9g) =p(k,g); i.e., such that (k, 9) equalizes the top parallel pair in 
the commutative diagram, with p the first projection, 


p22), pC x B= PC 


; 4 t 
1 ve pp ‘he [, (5) 


Therefore, we define the object Pg f with the arrow e as shown to be 
the equalizer of p and ¢ in this diagram. If we regard Pgf as an object 
over B (via e and projection on B) this finally gives a bijection 


Hom ,(C XB D,Q x B) = Homa(D, Psf). 


In other words, Pg f — B is the desired power object for f: C — Bin 
€/B, so that E/B is indeed a topos, as required. This completes the 
proof of Theorem 1. 

In the topos € = Sets, an element of the set Pg f described here is 
just a pair S, b with Sc C,b€ Band S c f~—'{b}; in other words, it is 
just a subset S of one of the fibers of f: C — B. More to the point, in 
€/B a global element of Pg f — B is a map to this object from B — B; 
such a map sends each b € B into a set S(b) in the fiber f~!{b} over }; 
it thus describes a subset T C Pa f in terms of its fibers S(6). 

Now consider any arrow k: B — A in a topos €. Pullback along k 
then turns each object f: X — A of E/A into an object f’ of €/B: 


X! ------+ >X 
| (6) 


{ 


f' 
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and so defines a change-of-base functor (or, pullback functor) 
k*: E/A > E/B. 
Theorem 2. For any k: B — A in a topos E, the change-of-base 
functor k*: €/A — €/B has both a left adjoint =, given by composition 


with k, and a right adjoint T,. Moreover k* preserves the subobject 
classifier and exponentials, and hence is a logical morphism. 


Here, as in §2, a logical morphism L: € + F between topoi is a func- 
tor which preserves finite limits, subobject classifiers, and exponentials. 
(As always, “preserves” means “preserves up to isomorphism” .) 

Proof: Since the topos €/B is now known to have exponentials, 
we can apply Theorem 1.9.4, valid for cartesian closed categories, to 
conclude that &* has both adjoints, as stated. Since it has a left adjoint, 
it must preserve all existing limits, in particular all finite limits. Clearly 
k* also carries the subobject classifier A x2 — A of €/A to that of €/B. 
To show that k* is a logical morphism (and hence that it preserves all 
topos concepts) it remains only to prove that it preserves exponentials. 
For objects g, f in €/A, this means that there is a natural isomorphism 


f 
k*(Y¥ ae Am) ~ [k*(Y i AP AA), 
If we write f’ for the pullback k* f, just as in (6) above, this isomorphism 


amounts to the commutativity up to isomorphism of the functors k* and 
( )f displayed in the following square 


f 
E/AS SEA 


XXA 
“fol 
( 


E/B <—— €/B 


Now each functor k* or (_ )/ [respectively ( )‘’] has a left adjoint 
Xi, or X x4 — (respectively X’ xg —). By the uniqueness (up to 
isomorphism) of left adjoints, it will therefore suffice to prove that the 
square of left adjoints here commutes up to isomorphism. But for an 
arbitrary (Y’ — B) in €/B, this isomorphism 


X X4 Uk(Y' > B) = Ly(X' xp (Y' — B)) 


follows at once from the definition of £; as composition with k and the 
definition of the pullbacks involved. 


From this proposition, we can deduce for any topos several conve- 
nient properties which are evidently true in the topos € = Sets. 
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Proposition 3. In a topos, the pullback of an epi is epi. 


Proof: An arrow e: X — A is epi if and only if the square 


xX —*— A 


is a pushout in € or, equivalently in €/A; cf. the remarks after I.9.(9). 
Since the operation k* of pullback along k: B — A has a right adjoint, 
it preserves all colimits; in particular, all pushouts; hence, it preserves 
epis. 


Proposition 4. In a topos €, any arrow k: A — 0 is an isomor- 
phism. 


Because of this property, one also says that 0 is a “strict” initial 
object. 

Proof: In a slice category €/B, the unique arrow 0 — B is clearly 
the initial object, while the identity 1: B — B is the final (i.e., terminal) 
object. In €/0 the identity 15: 0 — 0 is, therefore, both initial and final. 
Since pullback along the given k : A — 0 has both adjoints, the pullback 
g of 1p along & must thus be both initial and final in €/A. This means 
that we can write g as an arrow from 0 and find an isomorphism ¢ as in 
the following diagrams: 


Thus g = t is an isomorphism, so kg = 1 makes k = t~! also an isomor- 
phism, and the proposition is proved. 


Corollary 5. Every arrow 0 — B from 0 in a topos € is monic. 


Proof: If there are two arrows hi, he: X — 0, they are both 
isomorphisms, so their inverses h,~! and h2~! must both be the unique 
arrow 0 — X, so hy = hg and 0 — B is monic. 

It follows that 0 — B is the minimal subobject of B for any object 
Bin €. 

In set theory, the coproduct of two sets is often described as their 
“disjoint union”. Here is the analogous statement for a general topos: 
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Proposition 6. If S and T are disjoint subobjects of B (i.e., if 
SMT =0) then the join SUT in B is also their coproduct S + T. 


Proof: Let h: SB and k: TB be the inclusions of the given 
two subobjects. Their coproduct S + T in € is also their coproduct 
in €/B. The pullback functor k*: €/B — €/T is a left adjoint, hence 
preserves the coproduct (h,k): S+T — B in €/B. Now the pullback 
k*k is the identity, while the hypothesis of disjointness means that the 
pullback k*h is 0. Hence, the pullback of S +T — B along k is the 
identity T + T: 


Symmetrically, this means that the pullback along (h,k) turns k into 
the inclusion ig: T— S +T of the coproduct; by the same argument it 
turns A into the first inclusion 7}: S => S+T. Since the pullback functor 
(h,k)* preserves coproducts, the pullback of S+T — B along itself is 
the identity 


S+T——>S+T 


| | 


S+T ——— B. 


This in turn means that S+T — B is mono; hence, S+T is a subobject 
of B. So, by the definition of the join S UT below Proposition 6.3, the 
join is S+T, q.e.d. 


We note that this proof uses essentially the presence of adjoints to 
pullback. As a converse to Proposition 6 we also remark that any two 
objects S and T of a topos € are disjoint subobjects of their coproduct 
S+T in E (see Corollary 10.5 below). 

In §1X.6 we need the following infinite version of this result: 


Proposition 6 [bis]. Consider a family m;: S; — B of pairwise dis- 
joint subobjects of B. If their coproduct |] S; exists, the induced map 
m: [| S, — B is again mono, and so represents the supremum of the 
subobjects S;. 
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Proof: For each pair of indices 7, j7, the meet 5,S,; appears as the 
pullback 


Si n$,-———> 3; 


Le 


5 B. 


Since pulling back along a fixed m, preserves coproducts, this yields 
another pullback diagram 


[][s:95; ——][[s; 


jel gel 
Sj rn, B. 


Next, pulling back along m also preserves coproducts, so the preceding 
pullbacks summed over all i provide yet another pullback 


LI] ][s..s; —— | ]s; 
2€197E1 gel 


[[s. i B. 


1€1 


But, by assumption $,9.S, = 0 for distinct indices 7 and j, while S,NS, = 
S,. Therefore, the last pullback is really 


[[ss—"- Ls 


kel gel 

| [> 
=e 
ier 


This pullback implies that m is mono, as required. 


Proposition 7. In a topos, if f: X + Y and g: W — Z are epi- 
morphisms, then so is f xX g: X x W > Y x Z. 
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Proof: The map f x g can be written as the composite of the maps 
fx1l:XxW—YxW andlxg:Y xW —Y x Z. By Proposition 3, 
these maps are epi since the following two squares are pullbacks 


XxW—— Xx Yxw—?—W 
pa [ | | 
YK Wey, Y x Z—zz Z. 


Thus, their composite f x g is epi. 
Recall that the kernel pair of a map f: A — B is the pair of projec- 
tions to A from the pullback of f along itself, as in 


Ag ASA. 


Theorem 8. In a topos, every epimorphism is the coequalizer of its 
kernel pair. 


Proof: Fix a topos €. To begin with, let A be any object of €, and 
consider the coequalizer Q of the projections of A x A, as in 


Ax A= A—44Q. 


We claim that the unique map Q — 1 is monic. Indeed, consider two 
diagrams 
AxA—“45QxQ 


“| Ir i=1,2. 


A= Q 


Now m4 (q xq) = qm = 9m = %5(q x q), and q xq is epi by Proposition 7 
above, so 7 = 74. Consequently, if f and g: X — Q are two maps from 
an arbitrary object X into Q, then f = m{(f,g) = ™(f,g) = g. It 
follows that Q — 1 is monic, as claimed. 

Now suppose in addition that A — 1 is epi. Then Q — 1 must be 
epi as well, so Q & 1 and it follows that 


Ax A= A—> 1 


is a coequalizer diagram. 

If f: C — B is an arbitrary epimorphism in €, then we can apply 
the preceding argument to the object A = (f: C > B) of the topos 
€/B. That f is epi now means that A — 1 is epi in €/B. Hence, as just 
shown, A x A=3= A— 1 is a coequalizer in €/B. But this simply means 
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that C xp C3 C — B is a coequalizer in €. Thus f is the coequalizer 
of its kernel pair, and the theorem is proved. 


8. Lattice and Heyting Algebra Objects in a Topos 


Let C be acategory with finite limits. A lattice object, or an internal 
lattice, in C is an object L of C together with two arrows 


A: LxL—ol, V:ExLot, (1) 


called meet and join, which render commutative the diagrams which 
express the identities used in the equational definition of a lattice (§I.7): 
associative, commutative and idempotent laws for both A and \/ plus 
the absorption law 


t\(yV2) = x = (xAy)Vz- (2) 


For example, the absorption law is expressed by the commutative dia- 
gram 


i a LxL 
Pp [xv 
Lx LO Lx Lx LL x Lx (3) 
xl xT 
Pp ae 
L— Y, LxL 


in which p is the projection of the product on its first factor, 6: LD + LxL 
is the diagonal, and 7: L x L + L x L is the twist map interchanging the 
factors of the product; we have written the diagram as if the product 
were associative; that is, we have omitted from the middle of the diagram 
the canonical isomorphism Lx(Lx L) = (ZxL)xL. The upper rectangle 
in (3) corresponds to the left-hand side of the equation in (2), and the 
lower rectangle to the right-hand one. 

Moreover, such a lattice object L has a zero and a one (or a bottom 
element | and a top element T) when there are arrows 


T:1-8, L:1lad (4) 
from the terminal object 1 of C which satisfy the appropriate identities 


zVL=a, sAT =a, 
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that is, which make both composites 


|G De a ats ey ees 


bebe toe ep ete y 
the identity. 

Furthermore, L is called a Heyting algebra object in C or an in- 
ternal Heyting algebra, if there exists an additional binary operation 
=>:LxL— L (“implication”) satisfying the diagrammatic version of 
the identities of Proposition 1.8.3. 

If L is an internal lattice, one may define the corresponding partial 
order relation on L, according to the familiar formula which expresses 
the order relation by an equation: 


e<y iffeAy=c. 
Thus, we define a subobject <, of L x L as the equalizer 
e A 
Sp ~oLx Los. (5) 


The fact that (Z,<,) is an internal partial order (or internal poset) can 
be expressed by appropriate diagrammatic versions of the usual reflex- 
ivity, transitivity, and antisymmetry laws. To say that <x is reflexive 
means that the diagonal factors through <,, as in 


Ls Lx 
% 
“Se (6) 


<1. 


Define the subobject >; — L x L as the one represented by the com- 


posite <<, ~LxL—oLxL [with 7 as in (3)|. Then the antisymmetry 
of <; can be expressed by saying that the intersection <; N >1 of 
subobjects is contained in the diagonal, as in the pullback 


<n N 2p <1 
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Finally, transitivity of <; can be expressed by saying that the subobject 
(mev, meu): CL x L factors through <r, Lx L, where C' is 
defined as the following pullback, with projections u and v: 


CS eee 


<b ee LL 


[So if the ambient category C is Sets, then C is simply the set of triples 
(x,y, 2) with x < y and y < z, while u(z,y, z) = (y,z) and v(z,y, z) = 
(x,y). 

a of the internal relation <; on any internal lattice D of a 
topos € one can now define an internal Heyting algebra of € to be an 
internal lattice of € with an additional binary operation =: L x L - L 
such that the two subobjects P and Q of L x L x L defined by the two 
pullback squares below are equivalent subobjects: 


J ‘ 
' ( 
( e ( 
v wa 


LxLxlL— Jol x lege lx lx. 


This equivalence is a diagrammatic formulation of the definition of im- 
plication by aAb < ¢ iff a < (6 > c). This second definition of an 
internal Heyting algebra object is equivalent to the previous definition 
in terms of identities on =; the proof of this equivalence applies the 
Yoneda processes to the related functorial structures in the hom-sets 
Hom(X,L). The proof uses essentially the fact that the relation < is 
defined by equations, as in (5); for details, see Exercise 4. 

A homomorphism of lattices (or of Heyting algebra objects) L — L’ 
in € is an arrow f: L > L’ of € which commutes with all the operations 
involved; i.e., the diagram 


aT A 


1 ——— L +—*—__ Lx L 


ft he 


U U 
Pad —x xt 


commutes and similarly with T replaced by L and A by V (or by =). 
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Now recall that for an object A of a topos €, the set of subobjects 


Sub¢(A) has the structure of a lattice (in Sets), with O— A and A es A 
as bottom and top, and with meet and join as described in Proposi- 
tion 6.3 above. In fact, Subg(A) is a Heyting algebra. The reason is 
simply that the exponential UY of two open objects U1 and V1 
is again open, i.e., UY — 1 is monic, as is obvious from the universal 
property of the exponential. So the lattice Sube(1) has exponentials, 
hence is a Heyting algebra. Since for an arbitrary object A of a topos E, 
Subg(A) = Subg,,(1), it follows that Subg(A) is also a Heyting algebra, 
with as implication operator the exponential in €/A. 

Suppose k: A — B is a morphism in €, and consider the commuta- 
tive square 

Subg(B) —!——+ Subg(A) 


| I (10) 


E/B ——,—> E/A 
where i4: Sube(A) — €/B is the obvious inclusion (which identifies 
subobjects of A in € with subobjects of 1 in €/A). Since k* preserves 
exponentials, sums, and epimorphisms (Theorem 7.2), it follows from 
the description of the lattice structure of Subg(B) and Subg(A) just 
given that k—} is a homomorphism of Heyting algebras. In other words, 
we have proven: 


Theorem 1 (External). For any object A in a topos €, the poset 
Sub A of subobjects of A has the structure of a Heyting algebra. This 
structure is natural in A in the sense that the pullback along any mor- 
phism k: A — B induces a map k~' of Heyting algebras as in (10). 

There is a corresponding “internal” result: 

Theorem 1 (Internal). For any object A in a topos €, the power 
object PA is an internal Heyting algebra. (In particular, so is the sub- 
object classifier 2 = P1.) Moreover, this structure is natural in A, 
in the sense that, for a morphism k: A — B in €, the induced map 
Pk: PB — PA is a homomorphism of internal Heyting algebras. For 
each X in € the internal structure on PA makes Hom(X, PA) an exter- 
nal Heyting algebra so that the canonical isomorphism 


Subg (A x X)& Homeg(X, PA) 
is an isomorphism of external Heyting algebras. 


Proof: We have already indicated part of the proof in §6 [see (3) 
and (4) of that section], where we defined a meet, 


A: PAx PA PA, 
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in such a way that for any X € €, the meet operation on Home (X, PA) 
induced by composition corresponds to meet in the lattice Sube(A x X) 
under the canonical isomorphism Subg(A x X) & Home(X, PA). The 
other operations can be defined in exactly the same way. For example, 
for each X in €, Subg(AxX) has an implication operation >: Subg(Ax 
X) x Sube(A x X) — Subg(A x X), which is natural in X, as pointed 
out just before the statement of Theorem 1. Hence, there is a unique 
operation =x, again natural in X, such that 


Subg(A x X) x Subg(A x X) ——> Sube(A x X) 


a 


Home(X, PA) x Homeg(X, PA) Q (11) 


a 


Home (X, PAx PA) —sxz— Hom, (X, PA) 


commutes. By naturality of >x in X, the latter operation must then 
be induced—via composition—by a uniquely determined map >: PA x 
PA-— PA (this is an application of the Yoneda lemma). 

Top and bottom objects T and L are treated similarly. The fact 
that Pk: PB — PA is a homomorphism of Heyting algebras follows 
from the naturality of the Heyting algebra structure on Subg(A), by 
commutativity of the diagram 


Subg(B x X) ——— Hom¢(X, PB) 
(kx | frames (12) 
Sube(A x X) > Home (X, PA). 


Indeed, since (k x 1)~! is a homomorphism of Heyting algebras, so is 
Home (X, Pk) for each X in €. But then Pk must be a homomorphism of 
internal Heyting algebras, as follows easily by unwinding the definitions. 
This proves Theorem 1. 

Note especially that the proof uses the definition of internal Heyting 
algebras by the operation => and not that by the binary relation <,. 

To conclude this section, let us remark that the internal Heyting 
algebra structure on 2 is by definition the unique one such that 


Sube(X) —~> Home (X, 2) (13) 
is an isomorphism of Heyting algebras. So for two subobjects S and T 


: (s,t) 
of X with characteristic maps s and t: X — ©, the composite X ——, 
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axa — Q, denoted briefly by s > t: X — Q, is the characteristic 
map for S = T. And similarly, s A t, s V¢ are characteristic maps for 
SAT, SVT, respectively. Moreover, if S € Subg(X) is a subobject 
of X classified by o: X — 2, then =S € Subg(X) is classified by the 


composition X OG: [Note that the “” on —S is an operation 
in the Heyting algebra Sube(X), while the ““” in X 4 2 > / is the 
operation of the internal Heyting algebra structure of 0] 

The top and bottom elements of the Heyting algebra Subg(1) are 


the subobjects 1 —+ 1 and 04 1, respectively. Under the isomorphism 


Sube(1) = Home(1,), as in (13), these correspond to the top and 
bottom elements “true” and “false” of the Heyting algebra Home (1, {), 


so that 
| fs and | ee (1 4) 


true , 1 false Q, 


are both pullbacks (the first trivially so, the second by definition of 
“false” ). In any Heyting algebra, the negation interchanges the top and 
bottom elements, as in 


0=1, -71=0. 


When one transposes these identities along the isomorphism Sub¢(1) = 
Home¢(1,), one obtains commutative diagrams 


1 false 1 true 


QD QD 
aK I a I (15) 
QD Q. 


Notice that, in fact, the square 


— | 


ae Ie ( 1 6) 


Q —.——> 2 


is a pullback. Indeed, in order to verify this, take any object T in € 
and an arrow 0: T — 2 such that soo = trueo!, where !: T > 1 
is the unique arrow into the terminal object. If ST is the subobject 
classified by 0: T — ©, then by the identity sog = trueo!, the subobject 


(S) > T is the maximal subobject T — T of T. But then 0 = SA~S = 
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SAT = S, so that the given arrow 0: T — 2 in fact classifies the 
subobject 0 T. But in the diagram 


0 ——_— 0 ———+ 1 


aa: 


! ‘alse 


the outer rectangle is a pullback because the square on the right is a 
pullback by the definition of “false”, while the square on the left is 
evidently a pullback. Thus, falseo! also classifies 07, and hence 
ao = falseo!. This shows that (16) is a pullback. We will use this result 
in §VI.1. 


9. The Beck—Chevalley Condition 


Throughout this section, we work with a fixed elementary topos €. 

Let f: A — B be a morphism in €. In Proposition 6.3 we proved 
that f~1: Sub(B) — Sub(A) has a left adjoint 3;: Sub(A) — Sub(B). 
Recall that for a subobject UA of A, 4;(U) > B is the subobject 
of B determined by factoring the composite UA — B as an epi 
followed by a mono: U -» 3;U — B, so that 4;U is the f-image of U. 
In Theorem 8.1 (external), we observed that f~!: Sub(B) — Sub(A) is 
a homomorphism of Heyting algebras. In particular, 


fU\U =Vv)=f"U) > f'(V) (1) 


for any two subobjects U and V of B. Hence, for an arbitrary subobject 
W of A, 


a(W)AU <V_ iff 3,(W)< (USV) (cf. def >) 
if W< fi >V) (by 3; 4 f7?) 
if W<f"'\(U)>f (Vv) [by ())) 
if WAf(U)<f(V) (by def >) 
iff AsWAf(U)) <V (by 4y 4 f~*). 


Since this holds for any V € Sub(A), one obtains the so-called Frobenius 
identity, or projection formula for f: A> B,W Cc AandU CB: 


4y(W)AU = 3;s(WAf-*(U)). (2) 


(For a related argument, see Exercise 2.) 
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Now suppose we are given a pullback diagram 


Cx,B—"_—B 


[TP 


C= se. 


Consider a subobject U — B, as well as the induced subobject p~!(U) = 
Cx,4U-—C x, B. To compute 37U, one factors UB — A as an 
epi followed by a mono, as U-» 3,U — A. Since pulling back along g 
preserves mono’s as well as epi’s (cf. Theorem 7.2, Proposition 7.3), it 
follows that p~!(U) — g~14,U — C is again an epi-mono factorization. 
This may be pictured as in the following diagram, in which the front, 
back, bottom, and top faces are all pullbacks 


pe) ae ee 
ef 4a,U 
Cx,B B 
IN \ 


Cc ——,~—— A. 


In other words, 3,p~1(U) = g~1Ss(U). Thus we have proved 
Proposition 1. For a pullback square (3), the diagram 


Sub(C x4 B) <2 sub(B) 


Ap 
[a 
4 


Sub(C) ——=—— Sub(A) 
satisfies the Beck-Chevalley condition: for any subobject U of B, 
Gg *A;U = 3p U. 


By symmetry, the identity 3,q7-1V = f ~13,4V for subobjects V of C 
holds as well. 

We now wish to derive an “internal” version of the existence of the 
left adjoint 4, for f~'. Let (P,<) and (P’,<’) be internal partially 
ordered objects in € (§8), and let 

g 


PoP 
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be order-preserving maps in € (ie., the composition < —P x 


oxo 
P — P' x P' factors through <’ — P’ x P’, and similarly for ~). 
We say that ¢ is internally left adjoint to w if (¢y, 1p.) and (1p, ¢) 
factor through <’ and <, respectively: 


P'. (ov, 1p, ) P' x P' Be (lp, vd) P x P 
PPS Oey 
<i < * 


[These are just the diagrammatic versions of the conditions familiar in 
the case € = Sets: ¢ is left adjoint to ~ when ¢y(p’) < p’ for all p’ € P’ 
and p < w¢(p) for all p € P, since these inclusions give respectively the 
counit and the unit for the adjunction and the triangular identities then 
follow formally.| 

The internal adjunction between ¢ and 7) may also be expressed as 
follows: for any object X of E, Home(X, P) is a partially ordered set 
when for f, g: X — P one sets 


f<g iff (f,g): X — Px P factors through < — P x P, 


and similarly for Homg(X,P’). Then @ is internally left adjoint to 
# as above iff for each object X of €, the order-preserving map 
og»: Homg(X,P) — Home(X, P’) induced by ¢ via composition is left 
adjoint to the similarly induced map 7.: 


Home (X, P) = Home(X,P’), $+ 44. 


Theorem 2. Let f: A— B bea map in €. Then Pf: PB + PA 
has an internal left adjoint 37: PA — PB. Moreover, the internal 
projection formula (2) holds, i.e., the diagram 


PAx PB aga) PBx PB 
vPs| I" (5) 
PAx PA—;—+ PA; > PB 


commutes. And the internal Beck-Chevalley condition holds, i.e., for 
any pullback square of the form (3) above, the square 


P(C xa B)~~2~— PB 


commutes. Pg 
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Proof: The proof is just a matter of observing that the correspond- 
ing external statements hold in a natural way. More precisely, consider 
for an object X of € the external left adjoint 4,1) as in 


Ace x1 
Sube(A x X) ZS Subs (B x X). (7) 
(fx1)7* 


Clearly (f x 1)~ is natural in X, but so is (7,1), i-e., for any h: Y + X 
the square 
Sube(A x X) £22, Sube(B x X) 


xn | fox 


commutes. [This is a special case of the Beck-Chevalley condition of 
Proposition 1, applied to the pullback square 


AxX—2..BxX 


| | 


AxY—a- BxyY)] 


Consequently, the unique maps (4f)x and (Pf)x obtained from the 
natural isomorphism Sub(— x ?) = Hom(?, P — ) as in 


4 xl 
Sube(A x X) = 25 sube(B x X) 
(Fxay? 
(Af) x 


Home (X, PA) ———— Home (X, PB) 
(Pf)x 


are again natural in X. Hence by the Yoneda lemma, they must be 
induced by uniquely determined maps 


4 
PA =e PB. (9) 


(This gives the same Pf as defined before.) Since (7) is an adjunction, 
so is (3s)x 4 (Pf)x for every X. Therefore, as explained above, 3; is 
an internal left adjoint for Pf. 
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The internal Beck-Chevalley condition is derived in a similar way, 
by observing that the corresponding external condition holds, natural in 
a parameter object X. More explicitly, for any object X, we take the 
product of the pullback (3) with X and obtain a pullback 


(Cx, B)x X22 BxX 


| |r (10) 


Cx X ——— Ax X 
gXx1 


The external Beck-Chevalley condition (Proposition 1) for this square 
says that for any subobject U of B x X, 


(gx 1)7*3¢x1)(U) = A(qx1) (Pp Xx 1)~*(U). 

Passing to hom-sets via Sub(A x X) & Home¢(X, PA), as in (8), this 
means that for any u: X — PB, Pgo3;ou = (Pg)x(Af)x(u) = 
(av)x (Pp)x(u) = 4g 0 Ppo u. Commutation of (6) in the theorem thus 
follows by taking u to be the identity on PB. 

The proof of the internal projection formula is analogous, and is left 
as Exercise 9. 

Summarizing, given f: A + B we now have adjoint functors (with 
oa and ig defined as in Proposition 6.5 and Ly, composition with f, as 
in Theorem 7.2): 


Apap 


val le o i, (11) 


i pF ae a 


and by construction, f*ig =i4f~) and Opus = Ayo. Moreover, when 
A and B are replaced by A x X and B x X, these adjunctions hold 
naturally in the parameter object X, and therefore one also obtains 
internal adjoints PA PB, as in Theorem 2. 

Now recall that f*: E/B — €/A also has a right adjoint II, (cf. The- 
orem 7.2). Since a right adjoint functor automatically preserves the ter- 
minal object as well as monomorphisms, IIy restricts to subobjects of 
the terminal object. In other words 


Proposition 3. For f: A — B in €, the functor f-!: SubB > 
Sub A has a right adjoint V;, such that 


Sub A —!—> Sub B 


ul [= (12) 


E/A———> £/B 
commutes. / Ils / 
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Special cases of the existence of this right adjoint have already been 
considered in §I.9 (quantifiers as adjoints) and §III.8.(16). 

Notice that these right adjoints satisfy again a version of the Beck- 
Chevalley condition: for a pullback (3), the diagram 


Sub(C x4 B) —2-+ Sub(B) 


a 


Sub(C’) 3 Sub(A) 
commutes. Indeed, commutativity of (13) is equivalent to commutativity 
of the diagram obtained by replacing all arrows in (13) by their left 
adjoints. The diagram thus obtained commutes by the Beck-Chevalley 
condition of Proposition 1 above. 

As for the left adjoints 47, one can deduce an internal version of 
the preceding proposition, by observing that it holds naturally in a pa- 
rameter object X. To see this, fix f: A — B and consider a map of 
parameter objects h: Y — X. Then the square 


Sub(A x X) OXY , Sub(B x X) 


xn| cos 


Sub(A x Y) Tae Sub(B x Y) 


commutes, as a special case of the Beck-Chevalley condition (13) above. 
Consequently, the map (Vf)x, defined by requiring the square 


Sub(A x X) 0), sub(B x X) 


to commute, is natural in X. Hence it comes from a unique map 
Vy: PA — PB, by composition. Analogous to Theorem 2, one thus 
has: 


Proposition 4. For any f: A — B in €, the map Pf: PB + PA 
has an internal right adjoint Vs: PA — PB. 


As before, one can deduce an internal version of the Beck-Chevalley 
condition (13). We leave the proof as Exercise 9. 


210 IV. First Properties of Elementary Topoi 


10. Injective Objects 


In any category C, an object K is said to be injective when for 
every monomorphism m: S.— B in C every f: S — K can be extended 
tog: B— K with gm = f, as in the diagram 


S samme: 


WL 


K. 


a 


9 (1) 


Equivalently, this states that for each monomorphism m: S — B the 
induced map 


m*: Homc(B, K) > Homc(S, K) (2) 


is onto. In homological algebra, the injective objects in categories of 
modules play a special role; since every module M can be embedded in 
an injective module, repeated embeddings give an injective resolution (a 
long exact sequence) from M. From these resolutions one may calculate 
the derived functors of M. 


Proposition 1. In any topos €, the subobject classifier 0 is injec- 
tive. 


Proof: Any arrow f: S—1 (a “property” of S), as on the left in 
(1), is the characteristic function of some subobject T — S of S. But this 
T is also a subobject T — S — B of B and as such has a characteristic 
function g: B — ©. This means that the right-hand square in the 
diagram 


T =—— T—— 1 


a: 


Da Be 


is a pullback. Since m is monic, the left-hand square, and therefore the 
whole rectangle, is also a pullback. Thus, the composite gm must be 
the characteristic function of T — S; i.e., must equal the given arrow f, 
as gm = f. This gives the factorization required in (1). 


Corollary 2. For any object C in a topos, PC is injective. 
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Proof: For each S — B, the definition of P yields the commutative 
diagram 


Hom(B, PC) ————> Hom(S, PC) 


~ | 


Hom(C x X, 2) ———> Hom(C x S,). 


~ 


By the Proposition, the bottom map is onto; hence so is the top map. 


Corollary 3. Any object C in a topos has a monomorphism to an 
injective object. 


Proof: By Corollary 2, with Lemma 1.1, the singleton map 
{-}e: C — PC is such. 

This property is reminiscent of the fact that over any ring, modules 
may be embedded in injective modules. 


Corollary 4. Ina topos, the pushout of a monic along an arbitrary 
map is again a monic. Moreover, pushout squares of this form are also 
pullback squares. 


f 
Proof: Given a diagram B 520 with m monic, one may form 
the pushout Q (by Corollary 5.4), and also embed C in an injective, by 
Corollary 3, above. This yields a diagram 


S ———+C 


Since PC is injective, {-}c o f extends to B, to give a map g: B— PC 
making the outer square commute, as shown. Since Q is the pushout, 
there is a unique h: Q — PC with {-}c = hm’ and hf’ = g. Since 
{-}¢ = hm’ is monic, so is m’. This proves the first part of the Corollary. 

For the second part, we must show that the square with vertex Q is 
also a pullback. It suffices to prove that the outer (distorted) square with 
vertex PC is a pullback. To this end, we use the special way in which 
the extension g can be chosen, according to the proof of Proposition 1 
and Corollary 3 above. Indeed, {-}¢: C + PC is the transpose of the 
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classifier C x C' — 0 of the diagonal map 6: C' > C’x C, by construction. 
So {-}¢ of: S — PC corresponds under Hom(S, PC’) = Sub(C x S) to 
the subobject of C x S obtained by pulling back 6 along 1x f: Cx S— 
CxC. But j 


S$ ———C 


i r 


Cx S——>CxC 


\xf 


(f:1) 
is a pullback. So {-}¢ of corresponds to the subobject S ——> C'x S of 
Cx S. The map g: B — PC then can be chosen to correspond to the 
subobject of C x B represented by the composite 


SPY x SEE x B. 


Now suppose we are given X “B and X —+C such that {-}cou= 
gou. Under Hom(X, PC) & Sub(C x X), the map {-}c¢ ov corresponds 
to (v, 1): XC x S (just as in the case of f: S + C just considered), 
while g o u corresponds to the pullback along 1 x u of the subobject 
(f,m) corresponding to g; i-e., to V as in the pullback 


But if {-}cov=gou, then VC x X and (v, 1): X + C x X must 
be the same subobjects, so X factors through S' as in 


Cx X— > Cx B. 


Then mw = u and fw = v, and w is unique with this property since 
m is monic. This proves that the outer square of (3) is a pullback, as 
required for the corollary. 


Corollary 5. Let X BK ey Y bea coproduct diagram in a 
topos. Then the maps 7 and j are monomorphisms, and the square 


0 —--——_—- Y 


| | 


is a pullback (so that X and Y are disjoint subobjects of X + Y ). 
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Proof: The square is obviously a pushout. Since the maps 0 > X 
and 0 — Y are monomorphisms by Corollary 7.5, the result follows from 
the preceding corollary. 


Corollary 6. For any family of monomorphisms m;: S; + B; for 
i € I ina topos, their sum (coproduct) m: || S; > |] Bi, when it exists, 
is again mono. 


Proof: By Corollary 5 and the associativity of the (infinite) coprod- 
uct, each of the coproduct inclusions v;: B; + []B; is mono. Thus, we 
may consider S; as a subobject of B = |] B; by composing the two maps 


jm: Sj-—> By B. 


Moreover, coproducts in a topos are disjoint, by Corollary 5, so for 
distinct indices 7 and j the subobjects B; and B, of B are disjoint. A 
fortiori, the smaller subobjects S; and S; are then also disjoint. By 
Proposition 7.6 [bis] their sum |] S; + B = |] B; is mono, as claimed. 


Exercises 


1. For a category €, let M(E€) be the category with objects the mon- 
ics of € and arrows the pullback squares of such monics. Prove 
that a subobject classifier for € is the same thing as a terminal 
object in M(E). 

2. If F: C > Dis left adjoint to G: D — C, while both C and D are 
cartesian closed, show that the unit and counit of the adjunction 
yield a canonical map F(C x GD) > F(C) x D, while the fact 
that G preserves products yields by evaluation a canonical map 
G(D*) x G(E) — G(D). Use the Yoneda lemma to prove that 
the first canonical map is iso iff the second is. 

3. Give a detailed proof of Theorem 4.2 (Beck’s theorem). 

4. Prove that the two definitions of an internal Heyting algebra, as 
given in §8, are indeed equivalent. 

5. In any cartesian closed category C, prove that the internal com- 
position C? x B4 — C4 defined in §2 is associative. Also show 
that for a given object C, C© is a monoid object in C with a 
two-sided unit e: 1 + C© which is the transpose of the identity 
C — C. If, in addition, C has pullbacks, construct the object 
Aut(C) of automorphisms of a given object C of C, and prove 
that Aut(C) is a group object in C. 

6. Let E€ be a topos and f: A— Ba map in €. 


(a) Prove that f: A — B is epi iff Pf: PB — PA is mono. 
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(b) For a map u: X — A, the graph G,, of u is by definition 
the subobject G,, € Sub(X x A) represented by the mono 
(1,u): X + X x A. Check that if v: X — A is another 
map such that G, = G, as subobjects of X x A, then 
u =v. Also check that 51, ¢: Sub(X x A) — Sub(X x B) 
sends Gy to G fou: 

(c) Using (b), prove that f: A > Bis mono iff dy: PA + PB 
is mono. 


7. Prove that for a topos €, the power object Pf of an object f: Y > 
X of €/X can be constructed as the following pullback 


Pf a Se 


| f 


X x PY Tyg P(X) x P(X) 7 P(X). 


8. Check the naturality of A as asserted in (4) and (5) of §6. 


9. (a) For a given map f: A — B, state and prove an internal 
version (involving 3;: PA + PB and Pf: PB — PA) of 
the projection formula (2) of §9. 
(b) For a given pullback square of the form (3) of §9, state and 
prove an internal version of the Beck-~Chevalley condition 
(13) of §9. 


10. If S and T are subobjects of some object E of a topos €, show 
that the square of inclusion maps 


SN T—\—> § 


| | 


T——— SUT 


is both a pushout and a pullback. (Hint: Use Theorem 7.8.) 

11. In a topos, a subobject S:— L x L is a “binary relation object” 
on L. From it construct for each object X a subobject Ry of 
Hom(X, L x L) which is functorial in X and such that t: X > 
Lx Lis in Rx iff Im(¢) C S. Investigate the converse. (Can one 
reconstruct S from a functorial Rx?) 

12. (a) Let C be a locally small category with all finite products. 
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If L is a lattice object in C, show that for each object X 
of C the hom-set Hom(X, L) is a lattice in Sets, and that 
Hom(-—, LZ) is a contravariant functor to the category of 
all small lattices. 

If L is an object of C such that each Hom(X, L) is a lattice 
in such a way that Hom( —, L) is acontravariant functor to 
the category of all small lattices, show that L has a (suit- 
ably) unique corresponding structure as a lattice object in 


13. Generalize the previous exercise from lattices to universal alge- 


bras. 


14. Let E be a topos, and A an object of €. An equivalence rela- 
tion on A is a monomorphism (p,q): R — A x A such that R is 
reflexive, symmetric, and transitive (expressed by the appropri- 
ate diagrams). One also says that the pair p, g: RA “is” an 
equivalence relation. 


(a) Let f: A — B be a map in €. Prove that the kernel pair 


(e 


— 


~— 


— 


Axp ASA of f is an equivalence relation on A. (The 
purpose of the rest of this exercise is to show the converse; 
see also the Appendix, §4.) 

Prove that if p, g: R= A is the kernel pair of some map 
A-— B, then it is also the kernel pair of the coequalizer of 
p and q. 

Let R be an equivalence relation on A. Show, using sym- 
metry and transitivity, that (p x 1)~!(R) = (qx 1)7}(R); 
ie., that the pullbacks of (p,q): RA x A along p x 1, 
respectively along g x 1: R x A— Ax A, are isomorphic 
as subobjects of R x A. 

Let 7: A > 4 be the transpose of the characteristic map 
r: Ax A+ Qof R-A-x A. Deduce from (c) that rp = 
7qa: RA 0A. 

Prove that the square 


R22 A 


‘| [F 


At SF: 


which commutes by (d), is a pullback (use the symmetry 
of R). 

Wrap up by the statement that every equivalence relation 
is the kernel pair of its coequalizer. 
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15. Let E be a topos. Recall that an object P of E is projective if 
the functor E(P, —): € — Sets preserves epis; that is, if for any 
epie: Y — X in € and any f: P — X, there exists ag: P— Y 
with eg = f. A category € is said to have enough projectives if 
for any object X of € there is an epi P + X with P projective. 


(a) Show that an object P of € is projective iff every epi 


q: X > P has a section (ie., an s: P + X with gs = 1). 


(b) Show that a retract of a projective object is projective, and 


(c 


) 


— 


that the coproduct (if it exists) of a collection of projective 
objects is again projective. 

Show that in case is a presheaf topos (€ = Sets©”” for 
some small category C), then an object P of E is projec- 
tive iff P is a retract of a sum of representables. (Hence 
presheaf topoi have enough projectives.) 

Show that if B is a complete Boolean algebra, then every 
object of the topos Sh(B) of sheaves on B is projective. 
Let X be a 7Tj-space (points are closed). Show that if 
the terminal object 1 is projective in the topos Sh(X) of 
sheaves on X, then X has a basis consisting of clopen 
sets (here clopen means closed and open). Conclude that 
if Sh(X) has enough projectives, then X has a basis of 
clopen sets. (What about the converse?) 

Show that 1 is projective in Sh(X) iff every open cover of 
X has a refinement by pairwise disjoint open sets. Show 
that 1 is projective in the topos Sh(NY) of sheaves on the 
Baire space NN, 


16. Let € be a topos. An object P of € is called internally projective 
if (—)?: € — € preserves epis. 


(a) Show that the following three conditions on an object P 


are equivalent: 
(i) P is internally projective; 
(ii) The right adjoint Ip: €/P — E preserves epimor- 
phisms; 
(iii) for any T in €, any epi Y » X in € and any map 
T x P — X, there exists an epi e: T’»T anda 
commuting square 


T’ x P——— Y 
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Prove that for any morphism f: A — B in €, the change- 
of-base functor f*: E/B — &/A sends internally projec- 
tive objects of €/A to internally projective objects of €/B. 
Also prove that in case f is epi, an object P — A is inter- 
nally projective in €/A whenever its image f*(P — A) is 
internally projective in €/B. 

In this part, assume that the terminal object 1 € € is 
projective. Show that an internally projective object of E€ 
is projective. Is the converse also true? Show that every 
object of € is projective iff every object of € is internally 
projective. 

Give an example of a topos for which every object is in- 
ternally projective, but not every object is projective. 

Let X be aT, topological space. Show that if every object 
of Sh(X) is internally projective, then X is discrete. [Hint: 
for a given point x of X, consider the projection of the 
constant sheaf A(Z/2) to the skyscraper sheaf Sky, (Z/2) 
with stalk Z/2 at x and stalk {0} elsewhere; see §IT.6.] 


V 
Basic Constructions of Topoi 


In this chapter we present some basic ways to construct new topoi 
from old ones. 

First we consider the construction of sheaves for a topology, gener- 
alizing the results of Chapter III. More specifically, for an elementary 
topos €, we introduce the notion of a Lawvere-Tierney topology j on €. 
This will include “point-set” topologies and the notion of a Grothendieck 
topology. Indeed, if € is the topos Sets©”” of presheaves on a given 
small category C, then a Lawvere-Tierney topology on the topos € cor- 
responds to a unique Grothendieck topology on the category C. We 
then proceed to define the category Sh, € of sheaves for such a Lawvere- 
Tierney topology j, show that Sh; € is a topos, and prove the existence 
of an associated sheaf functor. Such associated sheaf functors were con- 
structed earlier via bundles (Chapter II) and via the plus-construction 
(Chapter III). The method to be developed here is different from these 
two. 

The second construction concerns comonads. A comonad on a cate- 
gory € is given by a functor G: € — € and two natural transformations 
6: G > Ide and e: G > G?, satisfying the identities dual to those for 
a monad [cf. IV.4(1)]. Dual to the notion of algebra for a monad one 
has the notion of coalgebra for a comonad. We prove that if G is a 
comonad on a topos € such that G: € — € preserves finite limits, then 
the category of coalgebras for G is again a topos. 

As special cases of this last result, one obtains generalizations of the 
familiar constructions of the topos BG of G-sets for a group G and of 
the topos Sets©’” of presheaves on a small category C. Indeed, one can 
replace the category Sets by an arbitrary elementary topos €, and show 
that for a group object G in €, the objects of € equipped with an action 
by G form a topos; and similarly, if C is a “category object” in €, then 
the category of objects of € equipped with the structure of a presheaf 
on C (suitably defined) form a topos. 

Finally, we consider colimits. One can show that the colimit of a 
filtered diagram of topoi and logical morphisms between them is again a 
topos. In other words, the category of topoi and logical morphisms has 
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filtered colimits. We will treat just a special case, the filter-quotient con- 
struction, which generalizes the construction of ultraproducts familiar 
in model theory. 


1. Lawvere—Tierney Topologies 


To define sheaves on a topological space X, what “matters” is “what 
gets covered”. Thus, if C' is any collection of open sets U; of X fori € J, 
what matters is the collection, call it 7(C), of all those open sets covered 
by the U;. Then j of everything is everything and j(j(C)) = j(C), 
while for intersections one has 7(C; MN C2) € j(C1) Nj(C2)—and this 
inclusion is an equality in case both C and C2 are sieves. Moreover, 
if the collection C’ is actually a sieve S on some open set U of X, then 
j(C) is also a sieve and C' = S is exactly an element S € 1(U), where 
Q is the subobject classifier for the topos of all presheaves on X. Thus, 
j becomes a map 7:2 — 2. These observations may motivate the 
following general definition of a “topology” on a topos. 

Let E be a topos, and let 2 be its subobject-classifier. A Lawvere- 
Tierney topology (or briefly a topology) on € is amap 7:2 > Nin € 
with the following three properties 


(a) jotrue=true, (b) joj=J; (c) JoAN=Ao(j x J); 
{ttre 5G) Q— 9 axa. m 
SOON od 
co a Q x9 ——0. 


Since the subobject classifier can be interpreted as a collection of truth 
values, one may regard the morphism j as a kind of modal operator on 
these truth values. Of course, 7: 2 — | determines and is determined 
by the subobject J» which it classifies, as in the pullback diagram 


J ——— 1 


- re 


Seer aL 


and the definition (1) of a topology can also be phrased in terms of the 
subobject J. 

The intent of such an operator 7 may be illustrated in the case of the 
topos € = Sets? ” of presheaves on a topological space X. Recall 
(§1.4(4)] that the subobject classifier © of this functor category is the 
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functor sending each open set U into the set Q(U) of all sieves S on 
U, where a sieve S on U is a set of open subsets V of U such that 
W CV €S implies W € S. Recall also that each open subset V C U 
determines a principal sieve V, consisting of all open W C V, and that 
truey: 1 — (U) is the map that picks out the maximal sieve U on U. 
Now define J by 

J(U) ={S | S is a sieve on U and S covers U }, (3) 
where “S covers U” means that U = {V|V €S}. Consider W CU. 
Since the intersection SMW, defined by SNW ={VNW|V €«S}, 
covers W if S covers U, it follows that J is a subfunctor of 2. The 
corresponding classifying function 7: 2 — CQ is then given, for any sieve 
S on an open set U, by 

ju(S) = { W | W is open in U and SN W covers W }; 


in other words, jy(S) is the principal sieve V, where V = U{w|We 
S}. Thus jy(S) specifies exactly the largest subset V of U covered by 
the sieve S. Let us verify that the map 7: 2 — | thus defined is a 
Lawvere-Tierney topology. The second property of the definition (1) is 
obvious, and the first is just ju(O )= U. As for the third, consider 
two sieves S and T on U. Then W € ju(S)N ju(T) means that W is 
both a union W = UL, V; of sets V; € S and a union W = UV, of sets 
Vj; €T; this implies that W = U(ViN V7), so W € ju(S NT). Thus, 
ju(S) Nju(L) € ju(S NT). The reverse inclusion also clearly holds, so 
that ju(S)N ju(T) = ju(S NT), as required for (1)(c). 

The operator 7 serves to specify what each sieve covers. We recall 
that sheaves on a topological space can be defined in terms of coverings 
by sieves (II.2.2). It will turn out that sheaves can be similarly defined 
within any topos with a Lawvere-Tierney topology. 

The operator 7 also determines a unary operator A+ A, called “clo- 
sure”, on the subobjects A — E of each object E, by the correspondence 


Hom(E, 2) —=— Sub(E) 3 A 
Hom.) | | (4) 


Hom(E, 2) —-=— Sub(E) 3A. 
In other words, given A € Sub(E), its j-closure A is that subobject of 
E with characteristic function j(char A), as in the diagram 


A——— 1 true (5) 
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with both squares pullbacks from true. In other words, 
char(A) = j char(A). (5’) 


This operation is “natural” in FE, in the usual sense that for any map 
f: E— F in € and any subobject B of F one has (since f—' is defined 
by pullback) 


f-*(B) = (f7"B). (6) 


Proposition 1. For any topos €, an arrow 7: Q — Q determines 
by (5’) an operator on the subobjects of each object E of E 


Aw A,  Sub(E) — Sub(E), (7) 


which is natural in E € €. Moreover, j is a Lawvere-Tierney topology 
if and only if this operator has, for all A, B € Sub(£), the properties 


AcA, A=A, ANB=ANB; (8) 


we then say that A+ A is a closure operator. Conversely, an operator 
defined on all Sub(£) that is natural in E and has the properties (8) 
always arises in this way from a unique Lawvere-Tierney topology j. 


Proof: This is an example of the transfer of an algebraic structure 
(given by j and A) on an object 2 in E to the corresponding functorial 
structure on the hom-sets Hom(E£, 1) = Sub(£). The last two identities 
of (8) are the immediate translations of the last two identities (1) on 
a topology. As for the first identity, j(true) = true, its translation 
uses the fact that any subobject is a pullback of true: 1 — 2. Thus, 
j(true) = true implies in (5) above that the right-hand trapezoid, with 
the dotted top, is commutative. Since the outer rectangle is a pullback, 
this gives a map A — A and hence A C A, as in the first of (8). 
Conversely, this property A C A applied to the maximal subobject 1 —> 1 
of 1, with characteristic function true: 1 > 1, gives j(true) = true, as 
required. 

The inclusion A c A may also be written as an equation AN A = A. 


Therefore, a Lawvere-Tierney topology j on E can be described by the 
three properties of (1) with the first replaced by the equation 


1=A(x f)A:9450x2-*,0x0 AQ, (9) 
where A is the diagonal map for 2. 


We call A the closure of A> E, and we say that A is dense in E 
when A = E, and that it is closed when A = A. 
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The reader should be warned that the closure operators of the kind 
defined here have nothing to do with closed subsets in a topological 
space. Indeed, in the example given above of a Lawvere-Tierney topol- 
ogy on the topos of presheaves on a topological space X, closed subsets 
of X play no role whatsoever. Moreover, taking the closure of a subset 
of a topological space is an operation which commutes with unions, not 
with intersections as in (8). 

These topologies j include the Grothendieck topologies: 


Theorem 2. Every Grothendieck topology J on a small category C 
determines a Lawvere-Tierney topology j on the presheaf topos Sets©”’. 


Proof: Recall that the subobject classifier 0 for the presheaf topos 
is the functor 2(C) = { S | S is a sieve on C’}, and that a Grothendieck 
topology J assigns to each object C' of C aset J(C) of “covering” sieves 
with specified properties. So given a Grothendieck J, define 7: 2 — 2 
in terms of the description (§III.2) of when a sieve S covers an arrow g 
by 

jc(S) = {g|S covers g: DC} 


= {g| 9*(S) € J(dom g) }. 


Then for any f: C’ — C this definition gives jo: (f*S) = f*jc¢(S) for any 
sieve S on C, so 7 is indeed a natural transformation 7: 2 > Q. Since the 
maximal sieve tc (all arrows to C) always covers, we have jco(tc) = tc, 
so 7 otrue = true. For sieves S and T on C, S C T clearly implies 
jc(S) C je(T); hence, for any S and T, jo(SNT) € je(S)Njc(T). 
The converse inclusion also holds, by III.2(iva). Hence the jc defined 
by (10) satisfies (1)(c). Finally, jo(S) © jejo(S) for any sieve S on C, 
since jg is order-preserving and S C jo(S). Conversely, if g € jejc(S), 
then j¢(S) covers g. But for each h € j¢(S) one has that S covers h. 
Hence, by the transitivity property of a Grothendieck topology, S covers 
g, i.e., g € jo(S). Thus j satisfies (1)(b). 


(10) 


This proves Theorem 2. It follows by Proposition 1 that any 
Grothendieck topology J also gives a natural closure operator (which 
is explicitly described in Exercise 3). 

Conversely, we will show in §4 that every Lawvere-Tierney topology 
4 on a presheaf topos arises in this way from a Grothendieck topology. 
But the fascinating fact remains that there are other Lawvere-Tierney 
topologies on other topoi. For example, every topos € has a negation 
operator =: 2 — 2. The properties of = developed in §IV.8 show that 
double negation —7=: 2 — © is actually a Lawvere-Tierney topology. 
This will be applied to the independence of the Continuum Hypothesis 
in Chapter VI. 
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2. Sheaves 


The definition (and the properties) of sheaves on a topological space 
X can now be extended to define a sheaf for a Lawvere-Tierney topology. 
To do this recall that both a presheaf P on X and a sieve S on an open 
subset U of X are functors O(X)°? — Sets, ie., objects of the topos 
Sets°(™”? In particular, S$ is a subfunctor of the representable functor 
y(U) = Homex)(—,U) via the inclusion 


i: Sy(U). (1) 
Proposition II.2.1 shows that a presheaf P is a sheaf iff the induced map 
i*: Hom(y(U), P) — Hom(S, P) 


is an isomorphism for every covering sieve on U; that is, for every sieve S 
such that the corresponding subobject S of (1) is dense. This definition 
of a sheaf, like that in Chapter III, §4, will translate to an arbitrary 
Lawvere-Tierney topology. 

Let 7 be a Lawvere-Tierney topology on a topos €, with a corre- 
sponding closure operator () as in Theorem 1.1. Recall that for any 
object E of €, a subobject A of E is called “dense” if A = E. One also 
says that A» E is a dense monomorphism. 


Definition. An object F of € is called a sheaf (for the Lawvere- 
Tierney topology j, or a j-sheaf), if for every dense monomorphism 
m: A> E in €, composition with m induces an isomorphism 


m*: Home(E, F) —+ Home (A, F). (2) 


In other words, a map from a dense subobject of E into a sheaf can 
be uniquely extended to a map on all of E: 


A—— F (a sheaf) 
son 4 (3) 
i 


E. 


More generally, one also defines an object G to be separated if for 
each dense A E, 


Home(E, G) > Home (A, G) (4) 


is a monomorphism; i.e., the extension as in (3) with F' replaced by G 
need not exist, but if it exists, it is unique. 

We write Sh, € (or sometimes €;) and Sep, € for the full subcate- 
gories of € given by the sheaves and by the separated objects, respec- 
tively. (In §4 we will show that the definition of a sheaf as just given 
generalizes the notion of a sheaf for a Grothendieck topology.) 
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Lemma 1. Both subcategories Sh, € and Sep;€ of € are closed 
under all finite limits, as well as under exponentiation with an arbitrary 
object from E. 


Proof: The terminal object 1 of € is clearly a sheaf, hence is also 
separated; it is thus the terminal object in both Sh; € and Sep, €. If 
G3 H are two arrows in € between objects G and H which are separated 
(or are sheaves), we claim that their equalizer C’ in € is also separated 
(or a sheaf, respectively). For given any dense m: A» EF we may form 
the commutative diagram 


Hom(£, C') ———> Hom(E, G) ———3 Hom(E, H) 


m| | ly 


Hom(A, C’) ————> Hom(A, G) "3 Hom(A, H); 


since C' is an equalizer in €, both rows are equalizer diagrams in Sets. 
The assumptions for G and H mean that the two right-hand vertical 
maps m* are injections or bijections, respectively. A simple diagram 
chase then shows that the left-hand vertical map m* is an injection 
or bijection, respectively, as required. A similar argument shows that 
the product in € of two separated objects (or of two sheaves) is itself 
separated (or a sheaf). Therefore, the categories Sh; € and Sep, € above 
both have all finite limits (and the inclusions in € preserve limits), A 
similar argument shows that the inclusions preserve any limit present 
in €. 

As for exponents, if G is separated or a sheaf, then so is the exponent 
G* for any object B of €, in virtue of the commutative diagram 


Hom(E, G?) —"™—-+ Hom(A, G?) 


: - 


Hom(E x B,G) Tixmr? Hom(A x B,G). 


The argument for this conclusion uses the fact that m dense implies 
1xm: Ax B— E x B dense, because closure is natural under the 
projection : E x B — E, which has 7—'(A) = A x B, as in (1.6). 
Hence, the exponent G¥ given in € serves also as an exponent in the 
subcategories Sep; € and Sh; €. 

The arrows j, 1: 2 > 12 in E have an equalizer 0, 


1, 2+ 90. (7) 


Since j is idempotent, by the condition (1)(b) of §1, the universal prop- 
erty of this equalizer shows that there is a unique map r: 2 > Q; with 
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mr = j. Therefore, mrm = jm = m; since m is mono, this gives 
rm = 1. In other words, 2, is a retract of 9 and 


(0,0 (8) 


is the epi-mono factorization of j—so that 0; is the image of j. Since 
Q is injective (Proposition IV.10.1) and since a retract of an injective 
object is injective, it follows that 0; is injective. Since 7 o true = true, 
the latter factors through 2, as true,: 1 > ;. 

If a subobject A of E is characterized by a: E — 2, then by (1.b) its 
closure A is characterized by j oa. Hence A is closed in E iff j oa =a, 
or, iff a factors through 2, —Q. In other words: 


Lemma 2. The object 0, classifies closed subobjects, in the sense 
that, for each object E of €, there is a bijection (natural in E) 


Home (E, 2;) —~—> ClSub¢(E); (9) 


here ClSubg(£) is the lattice of closed subobjects of E. 


In the special case of the topos € of presheaves of sets on an ordinary 
topological space X, the subobject classifier Q of E had for Q(U) the 
set of all sieves S on the open set U of X, while the operator 7 for the 
topology sends each sieve S into the principal sieve given by the union of 
all the open sets W with W ¢€ S. In this case the image 1.;(U) of j thus 
consists exactly of all principal sieves on U; in other words, of all open 
subsets of U. In our treatment of sheaves on a space, we already saw in 
I1.8(3) that this functor 2; was indeed a sheaf, and was, moreover, the 
subobject classifier for the category of sheaves on X. This will also be 
the case for any j. 

Similarly, we wish to show for any Lawvere-Tierney topology j that 
Q; with true,: 1 — Q, is a subobject classifier for Sh; €. This will 
involve two things; namely, a proof that 0, is a sheaf and, for a subobject 
A of a sheaf EF, a proof that A is closed iff A is a sheaf. This first fact, 
that 0, is a sheaf, is an immediate consequence of (9), the definition of 
a sheaf, and the next lemma. 


Lemma 3. Let m: A> E be dense. Then the inverse image map 
m-*: Cl Sube(E) — ClSub¢(A) 


is an isomorphism. 


Proof: Define a proposed inverse m,: ClSube(A) > ClSub¢(E) 
to m~} by setting miU = Im(UV) for any closed subobject U of A; in 
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other words m,U is to be the closure of the image of U under m. Then 
for U — A closed in A, 


m-*m(U) = m7*(Am(U)) = m-13,,(U) =U, 


for m~'3, = 1 because m is mono, cf. Corollary IV.3.3. Since U is 
closed, this gives m—!m,(U) = U. For the opposite composite, with 
V > E closed in E, mym~*(V) = Imm—1(V). But m7'V is VM A, so 


mum '(V)=VNA=VNA=YV, 
the last equality because V is closed in F and A is dense there. Thus, 
m, is a two-sided inverse for m7!, q.e.d. 


Lemma 4. If m: A — E is a subobject of a sheaf E, then A is 
closed in E iff A is also a sheaf. 


Proof: (<) If A is a sheaf, consider the solid arrows in the following 
diagram: 


Since d is dense, the map 1 can be extended over all of A to give a 
unique r: A > A with rd = 1. Since E is a sheaf and d is dense, 
mrd = m = md implies by uniqueness that mr = m. Therefore, r is 
mono, both triangles commute, and r must be an isomorphism. 

(=) Suppose A is closed in E, and consider an arbitrary dense in- 
clusion d: D-— B and a map f: D — A. Since E is a sheaf, there is a 


unique g: B — E with gd = mf: 


f 


D———. 


Be: 


Therefore, since closure is natural, B = D < g-1(A) = g"\(A) = 
g7\(A), ie., g factors through m. This gives a unique h with hd = f, 
which shows that A is a sheaf. 
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Theorem 5. Let € be a topos, and j a Lawvere-Tierney topology 
on €. Then Sh, € is again a topos, and the inclusion functor Sh; € —> € 
is left exact and preserves exponentials. 


Proof: Since € has all finite limits and exponentials, so does Sh, € 
by Lemma 1, and these operations are preserved by the inclusion 
Sh; € >> €. Moreover the object 2, of € is a sheaf by Lemmas 2 and 3, 
and it classifies subobjects in Sh; € by Lemmas 2 and 4. 


3. The Associated Sheaf Functor 


The purpose of this section is to prove the following result. 


Theorem 1. Let j be a Lawvere-Tierney topology on a topos E. 
The inclusion functor has a left adjoint a: E — Sh, €; moreover, this 
functor a is left exact. 


The functor a of Theorem 1 is called the associated sheaf functor, or 
the sheafification functor, its value a(£) at a particular object E of E is 
called the sheaf associated to E, or the sheafification of EF. 

We have already seen several instances of Theorem 1. In Chapter II, 
we constructed for each topological space X a left adjoint to the inclusion 
Sh(X) > Sets?” (cf. I1.5), using étale spaces over X. And in §IIL5, 
we have constructed a left adjoint to the inclusion Sh(C, J) + Sets” 
for an arbitrary site (C, J), by means of Grothendieck’s “++ construc- 
tion”. To prove Theorem 1, however, we shall use a quite different 
method. 

As a preparation for the proof, we need to take a closer look at sepa- 
rated objects. In the statement of the following lemmas, recall that the 
singleton map {-}¢: C' — PC is the map corresponding to the subobject 
A = Ag € Sub(C x C) represented by the diagonal. If f: A > C is 
a map in €, the graph G(f) is the subobject of A x C’ represented by 
the mono (1, f): A > A x C. We assume chosen a fixed topos €, and a 
Lawvere-Tierney topology j on €. A first evident result is 


Lemma 2. Let BC’ be monic. If C is separated, then so is B. 


Lemma 3. For any object C' of €, the following are equivalent: 


(i) C is separated; 
(ii) the diagonal Ac € Sub(C x C) is a closed subobject of C x C; 
(iii) 72 of-}e= {-}c, as in the commutative diagram 
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Qe. 
(iv) for any f: A— C, the graph of f is a closed subobject of Ax C. 


Proof: (i) > (ii) Assume C' separated. Consider A> AC xC. 
Since C’ is separated and since the projections 7; and m2: Cx C —C 
coincide on A, they coincide on A. But AC x C is the equalizer of 
m and 7, so A < A. Thus, A is closed, as required in (ii). 

(ii) + (iii) The diagonal A is closed in C x C iff its characteristic 
map 6g: C x C — 2 satisfies 7 o 6g = 6¢. By taking the exponential 
transpose C’ > © of both 6¢ and j 0 dc, this is equivalent to the 
requirement that j° 0 {-}o = {-}e. 

(ii) > (iv) The graph G(f) of a morphism f: A — C is a pullback 
of A, as in 


AO COKE 


[ | 


G(f) ——— 4, 


so G(f) is closed if A is, because by (1.6) pullbacks commute with 
closures. 

(iv) > (i) Let m: A> B be a dense inclusion, and let f, g: B 3 C 
be maps with fm = gm. Consider for f the pullback diagram 


AX OR BeOS ORC 


ee 


G(fm)—— G( f) ————— A. 


Here A x C > B x C is dense, since it is the pullback of A — B along 
the projection B x C > B. Viewing G( fm) as a subobject of B x C, 
we see that G(fm) is therefore dense in G(f), so G(fm) = G(f) since 
G(f) is closed. Thus fm = gm implies that G(f) = G(g). But any 
map can always be recovered from its graph, so f = g and therefore C 
is separated. This completes the proof of Lemma 3. 
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For any object E of € we next construct a certain epimorphism 
On: E> E' (1) 


into a separated object E’. To do this recall [from 2.8] that there is 
a retraction r: 2 — Q; and take E’ to be the image of the composite 
re o {}e , as in 


E'»———» OF, 


where {-}r is the singleton map. Since 2 is both a sheaf and injective, 
so is the exponential 2,”. Its subobject E’ is therefore separated, by 
Lemma 2. Now j has the epi-mono factorization j = mr of (2.8). If E 
is already separated, Lemma 3, part(iii) shows that j¥ o {-}e = {-}5; 
then this map as well as r” o{-}, is already mono, so E’ = E. Because 
any subobject of a sheaf is necessarily separated, this proves 


Proposition 4. An object E of a topos E is separated iff it can be 
embedded in an injective sheaf (here 0;”). 


The kernel pair of an arrow t: E — W is usually defined as a univer- 
sal pair of arrows f, g: B — E with tf = tg; here we regard it instead 
as a monic (f,g): B — E x E. In other words, the kernel pair of t is 
the pullback of the diagonal AW along ft x ¢, as in 


B—————> AW 


kernel par p.b. | 


Ex E——— W x W. 


txt 


Now the kernel pair of a monic t is just the diagonal A: E - Ex E,a 
subobject of E x E. The following lemma states that the map 07: E > 
E’ of (1) above is as close to being a monic as it possibly can be. 


Lemma 5. For any object E of € there is an epimorphism 6, to a 
separated object E" such that the kernel pair of 0g: E — E’ is precisely 
the closure A of the diagonal A C E x E. 


Proof: By the diagram (2) with E’ an image, the kernel pair of 0¢ 
is the same as that of r¥ of{-}e: E> 03". To see that this kernel pair 
is contained in A, consider two maps f, g: B > E with r® of{-}zof = 
r£ o{-},0g. Transposing the first and composing with m to get 7 = mr 
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gives the bottom row in the following pullback diagram, where G(f) is 
the graph of f: 
f 


B———> E ————+ 1 


anatn| [2 | 


Bx E——— Ex E———- 9 ——— 2. 
fxi1 6 3 


By the definition of the closure (multiply the characteristic function by 
j) the bottom row is the characteristic function of the closure G(f). 
The given equality thus means that G(f) = G(g). Since the graph (1, f) 
factors through G(f), it must then factor through G(g) as in the triangle 
below. But G(g) is the pullback of A along g x 1, so the square below 
commutes and (g, f) = (g x 1)(1, f) must factor through A, as in the 
diagram 


BO? ,pxE— .ExE 


*, = | = | 
G(f) = G(g) ——4. 


This proves that the kernel pair of E » E’ is contained in A. 
For the converse, it suffices to show that 


Ke). oF “10,7 (3) 


commutes. But oF is a sheaf, and clearly the two maps in (3) agree 
on the dense subobject A of A. Therefore (3) commutes. This proves 
Lemma 5. 

Since, in a topos, epimorphisms are coequalizers of their kernel pairs, 
as stated in Theorem IV.7.8, it follows that 


A= EE’ (4) 
is a coequalizer. 


Corollary 6. Any map 0g: E— E’ as in Lemma 5 is universal for 
maps from E into separated objects. (And therefore E > E' defines a 
left adjoint to the inclusion Sep, € — €.) 


Proof: Let f: E — S be a map from E into a separated object S. 


f =. 
Then m™, 72: A= E — S commutes, hence so does 71, 72: ARES. 
By the coequalizer (4), f thus factors uniquely through EF — E’. 


Corollary 7. The inclusion functor Sh, € — Sep, € has a left ad- 
Joint. 
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Proof: Let E € € be separated. By Proposition 4 above, there is a 
monomorphism m: E— F into a sheaf F. Let E be its closure. Then 
E is a sheaf by Lemma 2.4. Moreover, E — E is dense, hence by 2.(3) 
is universal among maps from EF into sheaves. 


Combining the preceding two corollaries, we have proved the first 
part of Theorem 1, namely, that Sh, € — € has a left adjoint, given in 
two steps as follows. Take any map 0g: E — E’ as in Lemma 5 and 
embed £’ in an injective sheaf Iz; the composite E — Ig then has 
minimal kernel pair. Indeed, any map ig: E — Ig with minimal kernel 
pair arises in this way from some such 6g with E’ = im(iz). Hence the 
following rule for constructing the adjoint: 

Given EF, take any ig: E — Ig with Ip an injective sheaf and with 
minimal kernel pair A Cc E x E. Form the image ip(£) and its closure 
ip(£) in the lattice of subobjects of Ig. This closure is the associated 
sheaf a(Z) and the composite E — a(£) is the unit of the adjunction, 
all as in 


AE — E ——2—__ 5 Ip 


I<] 
ig(E)———— ig (E) = a(E£). 


For example, let us compute a(£) where u: E — A is a subobject 
of A and a(A) has been computed from some i4: A — I4. This means 
that AA is the kernel pair of i4, so that the right-hand rectangle below 
is a pullback. But (u x u)~1AA = AE, since u is mono and closure is 
natural in A (Proposition 1.1), so the left-hand square is also a pullback 


AE ————> AA ———— AI, 


re 


te earner es mere © x Ih. 

Therefore the whole rectangle is a pullback, which means that AF is the 
kernel pair of i4 ou, so that a(£) may be computed from I, via i, ou. 
It follows that the associated sheaf functor preserves monos. 

We shall now prove that this sheafification functor a preserves all 
finite limits: 

(a) a preserves the terminal object, because the terminal object is a 
sheaf already. 

(b) Let E and F be objects of €, and let Iz, Ip be as above. Then 
Ig xIp can be used as Ix F in the construction of a(E x F); since closure 
commutes with products, it is then clear that a(E) x a(F) & a(E x F); 
so a preserves products (as usual, with their projections). 
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(c) Finally, to show that a preserves equalizers, consider a diagram 
E—*>A = B, 


which is an equalizer in €. Let a(A) and a(B) be computed by I, and 
Tg as above. It follows that f and g can be extended to f’, 9’: Jas Ip 
[first to maps f and g on i4(A) since A > i,(A) is the universal map to 
the associated sheaf, then to I4 by the injectivity of Ig], as displayed in 
the two squares on the right of the diagram below. To compute a(£), 
we can by (5) take the embedding EA — I4, ie., Ig = I, and 
construct the following diagram from the given equalizer w: 


By—__*__> A——— 
ied - f 
in=iaoul] i 4u(E)——> ig(A) JF ts B(B (6) 
Ia 5555 4b kn. 
9g 


Let z: X — i,(A) be the equalizer of f and 9. It is enough to show 
that X C i4u(E) as subobjects of I, [or of i4(A)]. Construct the two 
pullbacks D and then Y displayed on the left below, while F an equalizer 
of f and g makes the right-hand square a pullback: 


Y-—_4— A 


ee ee 


— A——~ BxB 
D»—2— i4(A) NA 5 a = m7 
E ——— Ag. 


a, 
| 
X>—=— i4(A) 


Then, by the choice of 2, nay equalizes f and g. Therefore by (6), 
(fy, gy) factors through the kernel pair of B > Ig, which is Ag. So 
y factors through E — A as indicated on the right of the preceding di- 
agrams. Thus, nay(Y) < na(E) < na(E) = iau(E); or equivalently, 
since Y — D as a pullback is epi, z(D) < iau(E). But then since D is 
dense in X, also X < iau(E), as was to be shown. 
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This proves that a: € — Sh; € is left exact, and so completes the 
proof of Theorem 1. 

For later use in §VIII.8, we state the following simple consequence 
of Theorem 1. Informally, it says that to describe the subobject lattice 
Subgp, ¢(a£), it is not necessary to compute the sheafification aE of an 
object E of E. 


Corollary 8. Let E be an object of €. The associated sheaf functor 
induces an isomorphism 


Cl Sube(£) ———- Subs, ¢(ak£) 


between closed subobjects of E and subsheaves of aE. 


Proof: Since 0; is the subobject classifier of Sh, € and a is left 
adjoint to the inclusion, we have 


Subsn, e(ak) = Homsh, e(ab,0;) = Home(E£, Q;). 


The corollary then follows by Lemma, 2.2 (on objects classified by 2;). 


4, Lawvere—Tierney Subsumes Grothendieck 


This section will develop its title by showing that the definition of 
a sheaf for a Grothendieck topology J is actually a special case of the 
definition of a “sheaf” for a Lawvere-Tierney topology 7. This means 
in particular that the sheafification process just constructed for such a 
4 gives an alternative construction of sheafification for a Grothendieck 
topology. First we establish a converse to Theorem 1.2, asserting that 
every J is aj. 


Theorem 1. IfC is asmall category, the Grothendieck topologies J 
on C correspond exactly to Lawvere-Tierney topologies on the presheaf 
topos Sets” 


Proof: Recall that the subobject classifier 0 of Sets© is the func- 
tor with 0(C) the set of all sieves S on C’, while the action of a map 
f:C’ 3C on Q is given by 


FMC) AC), F*(S)={9| fg eS}. (1) 


Now any Lawvere-Tierney topology 7: 2 = 2 on SatsC” -clasines the 
subobject J 1 defined as in (1.2) by 


SEJI(C) iff Jo(S) = te, (2) 
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where tc is the maximal sieve on C’. Notice first that since 7 commutes 
with meets, as in §1.1(c), it must be order-preserving, so S € J(C’) and 
S CT imply that T € J(C). We claim that this J is a Grothendieck 
topology, ie., that J(C) is a set of “covering” sieves for such a topology. 
First, the property jo o trueg = trueg implies that tc € J(C): the 
maximal sieve covers. Next, since jc is natural, each map f: C’ — C 
in C gives jo: (f*S) = f*(jcS) for any sieve S on C; in particular, S € 
J(C) implies that f*(S) € J(C"), so this J satisfies the stability axiom 
for a Grothendieck topology. Finally to examine the transitivity axiom, 
consider a covering sieve S € J(C’) and another sieve T on C’ such that 
g*(T) € J(domg) for all g € S. We need to show that T € J(C); ie., 
that jc(T) = tc; for this it will be enough to show that jojo(T) = te. 
But for g € S with g: D > C one has g*(jcT) = jo(g*T) = tp, 
so lp € g*(jcT), which means that g € jcT for all g € S. Thus, 
S € jc(T) and, hence, te = je(S) € jcjc(T). 

Conversely, Theorem 1.2 shows that each Grothendieck J determines 
a Lawvere-Tierney 7. One may check that these constructions 7 1» J 
and J +> j are mutually inverse. This proves the theorem. 


Theorem 2. Let C be a small category and j a Lawvere-Tierney 
topology on Sets©” while J is the corresponding Grothendieck topol- 
ogy on C, as above. Then a presheaf P € Sets© is a sheaf for j, in 
the sense of §2, iff P is a J-sheaf, as defined in Chapter II. 


Proof: First consider a presheaf P which is a j-sheaf. Then for each 
object C' in C and each S € J(C) for the corresponding J one has 


Hom(S, P) © Hom(yC, P) & P(C), (3) 


the first isomorphism, because P is a sheaf and S € J(C) means by (2) 
that S — y(C) is dense, while the second isomorphism is by the Yoneda 
lemma. As is explained below §III.4(2), (3) expresses the fact that P is 
a J-sheaf in the sense of a Grothendieck topology. 

Conversely, suppose that P is a J-sheaf in the sense of Grothendieck, 
while A» E is some dense subfunctor of a presheaf EF on C. To show 
that P is a j-sheaf as defined in (2.2) we must extend any map co: A > P 
to a unique 7: E — P. Now the definition of the closure A and the 
construction of J from j in (2) shows for each e € E(C) that 


e€ A(C) iff j(char A)e = truec _ iff (char A)e € J(C). (4) 


But by the definition 1.4(7) of the characteristic function of A> E this 
holds iff the sieve 


(charc Ale= {f: D>C|le-f € A(D)} (5) 
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is in J(C); ie., is a covering of C. But the given o yields a matching 
family {op(e- f) | f: D 4C € (charg A)e} for this cover. Since P 
is a (Grothendieck) sheaf, this means that there is a unique element 
p € P(C) with p- f = op(e- f) for each f € (charg A)e. If one defines 
Tc (e) to be this element p one has the desired natural transformation 
tT: E > P extending o; this 7, like p, is unique. Hence the J-sheaf P is 
a j-sheaf. This completes the proof of Theorem 2. 


Note that (4) and (5) together state that the closure operator corre- 
sponding to the Grothendieck topology J is given by 


e€ A(C) iff C is covered by {f: D>Cle-feA(D)}, (6) 


for A C E, C € C and e € E(C) as above. This property will be used 
in §VIII.8 to discuss “open” geometric morphisms. 


5. Internal Versus External 


There are two ways of “working” in a topos €. On the one hand, 
one can consider € as a mathematical object which satisfies the axioms 
stated in IV.1, and then draw conclusions from the axioms. Since these 
axioms make assertions just about “all” objects and “all” arrows of € 
(i.e., statements in the first-order predicate calculus) they do not involve 
any set-theoretic assumptions; they can thus be viewed as (part of) an 
independent description of a category € as a universe of discourse. For 
a particular such topos € one may then wish to consider additional 
axioms; for instance, the requirement that the topos be Boolean (i.e., 
that its “natural” Heyting algebra object 2 be a Boolean algebra object). 
Properties of a topos or of several topoi formulated in this way and 
theorems so proved from the axioms are said to be “internal”. There 
is also a more restricted notion of “internal”, involving the so-called 
Mitchell-Bénabou language (Chapter VI). 

On the other hand, one can think of a topos € as an object formed 
within set theory, as a set of objects and a set of arrows for which 
composition (etc.) is defined so as to satisfy the category and topos 
axioms. On these sets one can then make all the familiar constructions 
of hom-sets, limits, adjoints, and the like. Developments in this style 
are called “external”; they fall under the familiar mathematical pattern 
of carrying out “all” mathematics within set theory. 

Some examples of the external/internal contrast are already at hand. 
The external hom-set Hom(A, B) has the exponential object B4 as its 
internal analog; for each there is a composition map 


Hom(B,C) x Hom(A, B)  Hom(A,C), (1) 
CB x BA _, CA, (2) 
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the latter defined in the expected way [IV.2.(5)] as the transpose of the 
composite of the two evaluation maps 


C® x BAx A CF? x B-£ 50. (3) 


Similarly, for subobjects of any object A of E there is an external lat- 
tice Sub(A), the set of all subobjects of A, to be contrasted with the 
internal lattice object PA. In Chapter IV, we developed these lattices 
together, but all the cited properties of the lattice object PA can be 
derived without any mention of Sub(A) or of any other set. Recall also 
from Chapter IV that the Beck-Chevalley condition has both an external 
form—for Sub(A)—and an internal one, for PA. 

Also, amap f: A — B yields by pullback both external and internal 
maps 

f—': Sub(B) — Sub(A), Pf: PB-PA,; 


both have left adjoints 3, (Proposition IV.6.3 and Theorem IV.9.2) as 
well as right adjoints V» (Propositions IV.9.3 and IV.9.4); the notation 
chosen does not distinguish the external Vy from the internal one. 

A set-based topos (i.e., one formed within a conventional set theory) 
must by definition satisfy all the axioms for a topos. Hence both the 
“internal” and the “external” constructions apply to such a topos. But 
when a topos is regarded as a possible foundation for mathematics, as 
in Chapter VI, only the internal concepts apply. 

In ordinary category theory, there is a related contrast between set- 
based definitions and diagrammatic ones. Thus a product A x B may 
be described by giving a bijection of sets 


Hom(X, A x B) ¥ Hom(X, A) x Hom(X, B), 


natural in X or equivalently by the usual universal property of the pro- 
jections. Many other notions (“monomorphism”, “projective object”, 
etc.) have equivalent pairs of definitions. 

In the category of sets a one-point set is a terminal object 1, so that 
an arrow z: 1 — A corresponds exactly to an element of the set A; 
namely, the image of the one point of 1. In any category with a terminal 
object 1 an arrow z: 1 — A is called (as for sheaves) a global element 
of A. In Sets, two parallel arrows f, g: A= B are equal iff fr = gz 
for every global element z of A. This need not be the case in many 
other topoi; when it is true, one says that the topos is well-pointed. This 
can be put more vividly: when a topos is well-pointed, a diagram there 
with two “legs” starting at an object A is commutative iff it commutes 
for every global element of A. In a general topos global elements are 
not “enough”--for example they are not enough to test diagrams for 
commutativity. Thus an arrow z: X — A from any object X is called a 
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generalized element of A or an X-based element or an element defined 
over X. The use of such generalized elements in a topos € supports the 
intention that working in a topos is like working with sets. 

For example, consider the “internal” composition defined in (3). 
With generalized X-based elements 


g:X>C?, f:X5B4, a: XA, 
this formula can be written in the familiar way as 


(go f)oa=go(foa), (4) 


where f oa, “evaluate f at a”, is just the composite e( f,a), while e is 
evaluation and ( f,a): X — B4 x A is the usual map to the product 
with components f and a. In other words, the same formula (4) with 
(generalized) elements defines both the internal composite go f and the 
usual external composition gf. 

The formula (4) in generalized elements defined over X is natural in 
X—as are all appropriate formulas in generalized elements. Because of 
this, to define a map ¢: D — E it is enough to propose a “composite” 
pox: X — E defined for all X-based elements z of D which is natural in 
the object X. The formula for ¢ oz then (for example) gives ¢ directly 
by taking X = D and x = 1p, and by naturality the “composite” is 
then an actual composite. 


6. Group Actions 


Let € be a category with finite products. Recall that a group object 
(that is, an internal group) in € is an object G of € equipped with three 
arrows e: 1 — G (the unit), m: GxG — G (the product), andi: G > G 
(the inverse) such that the usual axioms for a group—as expressed by 
diagrams—hold. It follows that composition with e, m, and i gives 
each hom-set Home(X,G) the structure of a group, natural in X, in 
the obvious way. Conversely, a group structure on Home(X, G) for each 
object X of €, natural in X gives G the structure of an internal group, 
by a standard application of the Yoneda lemma. 

If morphisms f, g: X — G are regarded as generalized elements 
of the group G (defined over X), one can multiply such generalized 
elements to get a composite 


fg=mo(f,g):X 3GxGoG, 


or an inverse t , 
fllstof:X +4G—4G. 
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A (left) action of G on an object A of € is a morphism p = pa: Gx A > 
A such that both diagrams 


TxA" SGA GxGxA—“4,GxA 
ee rf wn cE (1) 
A Gx A——,——> A 


commute. One can express this equivalently by the familiar identities 


f-(g-a)=(f-9)-, e€:a=a, (2) 


where f and g: X — G are now generalized elements of G (defined over 
any X in €), and a: X — A is a generalized element of A; then g-a 
stands for the composition 


XY9O,6Gx% ALA 


1 
and e in (2) is interpreted as the generalized element X — 1 = G. 

As before, giving an action by G on A in € is equivalent to giving for 
each X in € an action of the group Hom¢(X, G) on the set Home(X, A), 
natural in X. 

If G acts on two objects A and A’ by pw and yp’, say, a G-map from 
A to A’ is a morphism ¢: A > A’ in € such that the diagram 


Gx A—**,.64x Al 


[} ° 


A———— A’ 


@ 


commutes. With generalized elements, this is the familiar equation 


o(g +2) = 9° pa) 


for all X in € and all a: X — A, g: X —G. In this way, one obtains a 
category E° of objects of € equipped with a left G-action, or briefly of 
G-objects of E. 

Just as for a group in Sets, the case of right actions can be treated 
in exactly the same way. Let G°P be the same group but with the 
multiplication taken in the reverse order, so that the product is m°? = 
mor, where 7: G X G — Gx G is the twist map. Then the category 
Ee” of left G°P-objects in E€ is equivalent to the category of objects A 
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from € equipped with a right G-action A x G — A. For the sake of 
definiteness, this section will use just left actions. 
The forgetful functor 


UEP SE, UlAB)=A, (4) 


has a left adjoint F', which sends an object E of E to the “free” G-object 
(Gx E,mx1:GxGxE—G«x E). In particular, U preserves limits. 
In fact, it is not difficult to show that U is monadic (cf. §IV.4). Define 
a monad (T, u,v) on E by TE = G x E and 


He=mx1:GxGxEoGxeE, 


1 
ve =ex1: EX 1xXxE——-GxE. 


Then a T-algebra for this monad is exactly the same thing as a left G- 
object, and F 4 U is the familiar adjunction between the category €7 
of T-algebras and the “underlying” category €. 

In Chapter I, we observed that the category of G-sets forms a topos. 
The following result shows that this remains true when Sets is replaced 
by an arbitrary topos. 


Theorem 1. IfG is an internal group in a topos €, then the category 
E® of G-objects from € is again a topos. 


Proof: First, €© has all finite limits, and these are created by the 
forgetful functor U: E& — € (Proposition IV.4.1). For example, if G 
acts on objects A and B from €, then it acts on their product A x B by 
the familiar rule 

g- (a,b) = (g-a,g-b). (5) 


When g, a, and b in this equation are interpreted as generalized elements 
g: X > Gand a, b: X — A, then the equation (5) indeed defines an 
action Gx (Ax B) = Ax B. 

Next we construct the exponential C? of two given G-objects B and 
C. Start with the exponential C? in €. In the case where € = Sets, an 
element t € C? is a function t: B — C, and the action of an element 
g € G on this ¢ is defined “by conjugation” [cf. Exercise I.5(b)], as 


(g- t)(b) = g- (t(g7* -)). (6) 
But taking g: X — G, t: X — C® and b: X — B to be generalized 
elements defined over some object X of €, formula (6) defines a map 


GxC® x BC in €, and, hence, by transposition a map 


Gxc? _.¢c8, (7) 
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The straightforward proof that this map (7) is a valid action in Sets 
translates (via generalized elements) to a proof of the same result in €. 
The proof that (_ )* is then right adjoint to — x B is now done with 
generalized elements, copying the usual bijection Homg(A x B,C) = 
Homg(A, C®) for Sets. 

Finally, let us show how to construct a subobject classifier for €°. 
For this, take the subobject classifier true: 1 = 9 in € and give both 1 
and © the trivial G-action. Then true: 1 — 0 is a G-map, and we claim 
that this mono is the subobject classifier in €%. First observe that for 
any object (E, u) € E®, the following triangle commutes, where @ is the 
isomorphism 6({g,e) = (9,9 +e) (generalized elements) 


Gx E~——_4—__+GxE 


iN A (8) 


E. 


Now consider a mono m: D = E in E€°%; since U in (4) is a right adjoint, 
this m is also a mono in €, so it has a classifying map x there. Consider 
the commutative diagram 


CCD 2s OS Da ih 


ven] [pm "| fie 


Gx E—,—>+G x E—,,> E— 0. 

The left-hand square has both horizontal maps isomorphisms, so is a 
pullback in €. So are both other squares, while 726 = juz by (8) above. 
Hence in € both y72 and xp are characteristic maps for the mono 
1xm:Gx D—G «x E. Therefore they are equal. With generalized 
elements, this means that x(e) = x(g-e); in other words x is a G-map 
into the object 2 taken with the trivial G-action. Therefore y is the 
desired characteristic map of the G-subobject D — E; its uniqueness is 
immediate. This completes the proof that €© is a topos when € is. 


7. Category Actions 


Let E€ be a category with pullbacks. We shall describe the notion 
of a category object, or internal category, in €. Just as an ordinary 
small category consists of a set of objects and a set of morphisms, so 
an internal category C in E consists of two objects of E—an “object 
of objects” Co and an “object of morphisms” C,, together with four 
arrows of €, an arrow m for composition, as in (3), and three arrows 


d, 
626). “CpG. Cp > i (1) 
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for domain do, codomain dj, and identities e; with the first two, one de- 
fines the object Cz of “composable pairs” of morphisms as the pullback 


Co =C, XCo C, ay C, 


n| [a (2) 


Cc, —;— Co. 


Indeed, a generalized element h: X —» C; Xq, Ci is thus just a pair of 
such elements f, g: X — C, with dof = d,g, that is, “a composable 
pair”. One now requires, in addition to the morphisms in (1), a fourth 
morphism in € 


m: Co =C, xe, Ci — Cy (3) 


to represent composition of composable pairs. The axioms for an internal 
category then require, besides the usual identities dge = d,e = 1 and 
dom = dom2, dim = d,7™, commutativity of the following two diagrams 
which express the associative law and the unit law for composition: 


1xm 
C) XCo C, XCo C, —-——> C) XCo Cc, 


wigs | Ir 


C, XCo C, Sa Cy, 
(4) 


1xXe exl 
Ci Xo, Co——— Ci Xa, C1 +—— _ Co Xe, Ci 


C). 


These conditions (1)-(4) thus constitute a “diagrammatic” form of the 
standard definition of a category. If C = (C,,Co,e,do,d,,m) and D 
are two such internal categories, then an internal functor F: C > D is 
defined to be a pair of morphisms Fy: Co — Do and F,: C, — D, in E 
making the obvious four squares (with e, do, di, m) commute. With the 
evident composition of such functors one has a category Cat(€) with the 
internal categories in € as objects, and internal functors as morphisms. 

The definition above ensures that for an internal category C in € the 
Hom-sets Homg(X, Co) and Home(X, C;) for each object X of € form 
the collections of objects and morphisms of an ordinary ( “external” ) 
category Home(X, C) in Sets and this construction is “natural” in X. 
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Similarly, if F: C — D is an internal functor, then the pair of functions 
Home(X, Fp) and Hom¢(X, F) form the object and morphism parts of 
an ordinary functor Home(X,C) — Home(X,D). Furthermore, given 
two internal functors F' and G: C — D, one may readily define an 
internal natural transformation as a suitable arrow 0: Co — D, in €. 

This definition includes several familiar cases. Thus a category 
object in € for which Co is the terminal object is the same thing 
as a monoid object in €; and a category object C in € for which 
(do,di): C, —» Co x Co is monic is the same thing as a preordered 
object in €. 

Each internal category C determines in an evident way an oppo- 
site internal category C°P. This provides for the description of internal 
“contravariant” functors. 

In ordinary category theory, the functors F': C — D between two 
small categories play a role quite different from functors from C into 
the ambient category-—the category of sets. A functor of the latter sort 
consists as usual of an “object function” Cg — Sets and an “arrow 
function” C, —» Functions, suitably related. In this description, we 
now wish to replace Sets by any category € with pullbacks, and C by 
an internal category (again called C) in €. However, we have no such 
thing as an object function Co — €—arrows to “the universe” are not 
provided for us! In order to get a suitable “internal” description of such 
functors to the universe €, we first reformulate the usual case where 
the universe is Sets. There an object function Fo: Cy — Sets can be 
viewed as a Co-indexed family of sets, one for each A € Co. Just as in 
the treatment of indexed sets [§I.1(8)], this Co-indexed family can be 
replaced by a single object over Co, 


nr: F + Co, 


where F = |] Fo(A) is the disjoint sum of all the sets FyA, and 
AEC) 

m is the obvious projection. Each set FyA can then be recovered (up 
to isomorphism) from 7 as the fiber 1—1(A). Similarly, for the arrow 
function, each arrow f: A > B in C gives a map FoA —> FoB of sets, 
written for a € Fy A as a+ f-a. All these maps, one for each f € Cy, 
can be described in terms of 7: F' — Co as one single map specifying 
the action of any f on any a as 


uiC,xoq,F>F,  wlf,e)=f-a, 


where C, Xc, F -> F is the pullback of 7 along dg: C, — Co. 

By writing down the appropriate diagrams, the preceding description 
of a functor to Sets can be easily generalized to the case of an internal 
category C = (Ci, Co, e,do,di,m) in a category € with pullbacks. A 
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(left) C-object in € (also called an “internal diagram” on C) is an object 
x: F — Co over Co equipped with an action 


u:C, xo, FoF (5) 


of C on F, where for this pullback dp: C; —> Co is used to make C, an 
object over Co. Here the following diagrams are required to commute: 


Cy xq, F —*— F Co Xe FS G xe, F 
=| | — | 
C, + ae Co, F, 
(6) 
C, XCo C, XCo F Bele oF XCo F 
mx | | 
Cixg.f SS 
(The second and third express the unit and associativity laws for the 


action.) 

In terms of generalized elements, this can be expressed as follows: a 
generalized element of C; xc, F over X is simply a pair of maps f: X > 
C,, a: X + F such that ra = dof; write f - a for the composition 
u-(f,a). Then the commutativity of the three diagrams (6) above can 
be expressed by the familiar identities 


mf = a) = dy f, 
e(7a)-a=a, 
g:(f-a)=(gof)-a, 
for all a: X — F and f, g: X + C, with dof = ma, dif = dog; here 
g°f =m(qg,f) is the composite of the generalized elements g and f. 
If F = (F,x, «) and G = (G, 7’, p’) are two such left C-objects in E, a 
morphism of C-objects from F to G is simply a morphism ¢: F ~ Gin € 
which preserves the structure involved. In terms of generalized elements, 
this means that the usual identities 7(a) = 2'(¢(a)) and ¢(f-a) = f-d(a) 
are required to hold, for all generalized elements a of F and f of C; (such 


that f - a makes sense, i.e., dof = wa). In terms of diagrams, it means 
that 


pe SoG Ci Mea =e eee 
KY, fe 7 | (7) 
Co 


fF ———__—_+G 


@ 
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are required to commute. In this way, we obtain the category E© of all 
the left C-objects in €. 

If € = Sets, this clearly is just the familiar category of functors from 
C to Sets (up to equivalence of categories). In the special case when 
C is a group G in € (ie., Co = 1 and there is an inverse C; — C,), 
then E© is the category E% of G-objects, as considered in the previous 
section. 

As in the case of groups, one might also wish to consider the category 
of right C-objects of €, for an internal category C in €. These are objects 
aw: F — Co with an action yw: F xq, Ci — F on the right (the pullback 
F xe, C, is now along d,: C; — Co), defined in the obvious way. In 
the case where € = Sets, a right C-object is (up to isomorphism) the 
same thing as a presheaf on C (by identifying Cp-indexed collections of 
sets with sets over Cp, as above). In general, there is an equivalence 
between the category of right C-objects in € and left C°P-objects in €. 
Therefore, we may and shall restrict our attention to left C-actions. 

There is an obvious forgetful functor U: EC — €/Co, sending a left 
C-object (F, 7, 4) to the object 7: F — Co over Co. Eventually, our 
aim is to prove the following theorem. However, we shall take a rather 
roundabout route, and only achieve this goal at the end of §8. 


Theorem 1. Let € be a topos. Then for any internal category C 
in E, the category E© of left C-objects is again a topos. Moreover, the 
forgetful functor EC > E/Cpo has both a left and a right adjoint. 


By taking C°P instead of C, this theorem also states that the cate- 
gory of internal presheaves is a topos, generalizing the result of §1.6 for 
functor categories from Sets to an arbitrary topos €. 

To get an idea of what the left and right adjoint mentioned in The- 
orem 1 might be, let us consider some simple examples. 

As a first case, let G be a group in Sets, and consider the forgetful 
functor 

U: Sets? — Sets 


which sends a left G-set (A, u: G x A — A) to its “underlying set” A. 
This functor has a well-known left adjoint F’, sending each set X to the 
product G x X with the usual “free” actionm x1:GxGx X >~>GxX 
defined as in §6(4). But the functor U also has a right adjoint R, sending 
a set Y to the set Y of all functions from G to Y, with left G-action 
GxY® —Y® defined, for any g € G and g€ Y%, by 


(g9-g)(h)=o(hg) = (all hE G). (8) 
Since 9: (g2¢) = (9192), this is indeed a left G-action. That R is right 


adjoint to U can be seen explicitly by the bijective correspondence, for 
a G-set A and a set Y, 


Hom(A, Y) & Homg(A, Y%) 


7. Category Actions 245 


which sends an arbitrary function h: A > Y to the G-map h: A>Y¢ 
defined by h(a)(g) = A(ga). [Its inverse sends a G-map u: A — Y® to 
the function w: A — Y given by a(a) = u(a)(e), where e is the unit of 
G\] 

It is not difficult to generalize this construction of the right adjoint 
to the context of §6, and thus to show that U: €% — € has a right 
adjoint for an internal group G in an arbitrary topos € (Exercise 7). 

Next consider the case of a category C in Sets, with the forgetful 
functor 

U: Sets© — Sets (9) 


which sends a functor F': C — Sets to its “object-part” Fy: Co — Sets. 
[Recall that Sets©? is equivalent to Sets/Co, so that (9) is a spe 
cial instance of the forgetful functor U in Theorem 1 above.] The 
left adjoint for (9) is constructed much as in the case of groups. 
Given a function X: Co — Sets, i.e., a Co-indexed family of sets, 
let F = Fx: C — Sets be the functor defined by F(C) = {(g,2) | 
g € Cy, dig =C, and x € X(dog) }; for f: C > C’, the corresponding 
function F(f): F(C) — F(C’) is given by the formula F(f)(g,2) = 
(f og,2). We leave it to the reader to check that X — Fy is the left 
adjoint of U. (A more abstract description of the left adjoint F' occurs in 
the proof of Theorem 2 below.) To define the right adjoint R, suppose 
we already know that R exists. Then for Y: Co — Sets, R(Y) is a 
functor while the Yoneda lemma and the adjunction U + R give 


R(Y)(C) = Homgersc(C(C, —), R(Y)) 
= Homsets/c, (U(C(C, = )); Y). 


But then, we can drop the assumption of the existence of R, and use the 
second line of (10) as a definition of R. To prove that the functor R thus 
defined is indeed left adjoint to U, consider any functor X € Sets©. We 
wish to show that 


Homgets¢ (X, R(Y)) = Homsets/Co (U(X), Y). (11) 


For X a representable functor, i-e., X = C(C, —) for some C’ € Cp, this 
follows from the Yoneda lemma and the definition (10) of R. But any 
functor X € Sets© is a colimit of representables (Proposition 1.5.1), and 
U clearly preserves colimits. So (11) holds for general X. 

Now recall (§IV.4) that for a monoid M in Sets the forgetful functor 
BM — Sets from the category of left M-actions is monadic, and that 
the category of algebras for the resulting monad in Sets is equivalent to 
(actually is isomorphic to) the category BM. Exactly the same result 
holds when the monad M in Sets is replaced by an internal category 
C in a topos. The proof is essentially the same, except that arguments 
with elements are now replaced by diagrams. To begin with, consider 
an arbitrary category € with pullbacks. 


(10) 
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Theorem 2. Fora category object C in a category € with pullbacks, 
the forgetful functor U: EC — E/Co has a left adjoint F, and U is 
monadic; that is, the category E© of left C-objects is equivalent to 
(actually, isomorphic to) the category of algebras for the monad T = UF 
on €/Co (cf. §IV.4). Moreover, if € is a topos, then the functor U has a 
right adjoint. 

Proof: The forgetful functor U: EC —. E/Co assigns to each C- 
object (F, 7, 4) as in (5) and (6) the map 7: F —+ Co, considered as an 
object of the slice category E/Cp. Conversely, given an object k: X > 
Co of this slice category we first form the pullback 


C, xo, X —1— xX 


Cc, —,—— Co 
‘Oo 


and from this the object d, op: C, xc, X ~ C, — Co in E/Co, with 
the composition m of C giving the action as the map m x 1, as in the 
diagram 


Cj X63 (Ci XG XK) ee Ci eas X 


C, > Co. 


dy 


(This is just like the construction in §6 of the free action of a group G 
on the object G x A.) The rules for an action [as given in (6) above] are 
readily verified; therefore, the assignment 


F(h: X + Co) = (do op: C, Xo, X — C; — Co,m x 1) (12) 


defines a functor F: €/Cy — E©. It is left adjoint to the forgetful 
functor U. Explicitly, for k: X —» Co in €/Cy and Y = (Y,7, p) in ES, 
the correspondence 


Homec (FX, Y) = Home /c,(X, UY) 


is described as follows: from left to right, a given map of C-objects 
1 
a: FX =C xc, X - Y is sent to the composite X = Co Xo, X 5 


Ci xq, X ay Y; from right to left, a map 8: X — Y of objects over 
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Co is sent to the C-map C, xg, X — C; xq, Y “ Y. These 
correspondences are indeed mutually inverse. 

Moreover, the corresponding composite T = UF sends X —+ Co to 
Ci xc, X — Co, with the evident “multiplication” operation uy: T? + T 
given by the composition m of C,, and the “unit” 7: I — T given by the 
identities map Cp — C, of C. With these two natural transformations, 
T is a monad: the unit and associativity laws for T correspond exactly 
to these laws for C [cf. (4) above]. An algebra for this monad is thus 
a suitable map T(X — Co) — (X — Co) over E/Co, and from the 
definition of T it is easily seen that this is exactly the same as an action 
by C on X —+ Co. Consequently, the category E© of left C-objects is 
precisely the category (E/Co)7 of algebras for this monad. 

Finally, suppose € is a topos. The composite functor T = UF can 
then be written using (12) as a different composite of functors between 
slice categories, to wit a change of base followed by a composition ©, 


dy 2a, 
E/Co — E/C, —— E/Co, 
(X > Co) (C, XQp X 9 Ci) (Cy Xe, X > Co). 


By Theorem IV.7.2 for a topos, each of the two functors Lg, and dj has 
a right adjoint, and hence so does their composite T, as asserted in the 
theorem. 


8. The Topos of Coalgebras 


Another important construction of topoi uses the coalgebras for a 
comonad. We have already (§IV.4) considered monads in a category C. 
The dual notion is that of a comonad (or cotriple), which consists of an 
endofunctor G: C — C and two natural transformations 6: G-— GoG 
(“comultiplication”) and «: G — I (“counit”), where I is the identity 
functor on C, such that the following diagrams commute for each object 


C of C: 


CO 2 eG CC 
s| [se Lo ON (1) 
3 
CC aE GC, GC az- GC a GC. 


As for monads, if a functor F': B — C has a right adjoint R: C > B 
with unit 7: I — RF and counit «: FR — I, the composite endofunctor 
G = FR, with 6= FnR: FR —- FRFR and ¢ form a comonad on C. 
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Theorem 1 (Eilenberg-Moore, first part). IfT: A— A has a 
right adjoint G and T is also a monad for the natural transformations 


gn. IT and iit ST; (2) 


then its right adjoint is a comonad for the (unique) natural transforma- 
tions 
eG—oTI and 6:G—>G (3) 


determined by the requirement that they make the following diagram 
commute, for all objects A and B of A: 


1: A(A,B) == A(A, B) 
n| [- 
9: A(TA,B) = A(A,GB) (4) 


rf | 


@?: A(T?.A, B) = A(A,G?B), 


where 7, = A(na, B), etc., and 6 is the natural isomorphism given by 
the adjunction. 


For example, a monoid M in Sets determines the familiar monad T 
with T(X) = M x X for X aset. This functor T has a right adjoint G 
sending a set Y to G(Y) = Hom(M,Y). This functor G is a comonad 
with comultiplication the evident map 


Hom(M, Y) —» Hom(M x M,Y) = Hom(M, Hom(M, Y)). 


Proof: First notice that, by the Yoneda lemma, the commutativity 
of (4) for all A and B in A determines ¢g and 6g uniquely, and makes 
them natural in B. One may consider this uniquely determined 6g as a 
sort of “adjoint” natural transformation to 44, etc. The commutative 
diagram 


A(TA,B) 8 A(A, GB) 


rl | 


A(T?A, B) —2— A(TA, GB) —°—. A(A, G?B) 


ral | |(com. 


A(T? A, B) A(TA, G?B) A(A, G°B) 


6? 6 
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shows that 4° wra has such an adjoint Gég © 4g; in this diagram the 
top rectangle and the lower left-hand square present the definition of 6z, 
while the lower right-hand square commutes by naturality of 0. A similar 
diagram shows that 44°T py, has adjoint 6gp 06g. The associative law 
for » (the equality of the left composites in these two diagrams) then 
gives, with Yoneda, the equality of the right composites; that is, the 
“coassociative” law for the comonad: 


6ap 0 6p = Gép obp: GB > GB. (5) 


Exactly comparable diagrams, using the definition of €g, yield the other 
equation required for a comonad 


€gp 0 bp =1= Gep obp: GB — GB. (6) 


This completes the proof. 

A dual argument will show that a comonad structure on G will yield 
a monad structure on its left adjoint T. 

Now recall that a T-algebra for the monad T consists of an object 
A and an arrow h: TA — A such that 


l=hona: A-TA-A, hoTh=hopa:T’7A>A (7) 


(see §IV.4). The category A” of all such T-algebras has a forgetful func- 
tor U: A? — A, sending a T-algebra (A,h) to the object A; moreover, 
this functor U has a left adjoint F', sending an object X of A to the 
“free T-algebra” FA = (TA, 4: T?A > TA). 

Dually, for a comonad (G, 6,¢) on A, a G-coalgebra is an object B 
of A, equipped with a “structure map” k: B — GB such that 


l=epgok: B—>GB-B, Gkok=6gok:B—G’B. (8) 


With the obvious notion of morphism, this gives a category Ag of all G- 
coalgebras. Again, there is a forgetful functor V: Ag — A, V(B,k) = 
B; this time, V has a right adjoint R, which sends an object X of A 
to the corresponding “cofree coalgebra” (GX,6x: GX — G?X). The 
theorem above now continues as follows: 


Theorem 2 (Eilenberg-Moore, continued). Under the hy- 
potheses of Theorem 1, there is an isomorphism L from T-algebras to 
G-coalgebras, which commutes with the respective forgetful functors, as 


in the diagram 
~~ A (9) 
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Again take the example of a monoid M in Sets with the associated 
monad T with T(X) = MxY and comonad G with G(Y) = Hom(M, Y). 
A left action of the monoid M on a set X can be represented as the 
structure map h: T(X) — X of a T-algebra or by its transpose X — 
Hom(M, X), which is the structure map for the G-coalgebra X. 


Proof: Let L send the T-algebra (A,h: TA — A) to its transposed 
(A,k: A — GA) under the adjunction isomorphism 6: A(TA,A) ~ 
A(A, GA). Clearly, this commutes with the underlying functors U and 
V. As for the identities defining algebras and coalgebras, the two com- 
muting diagrams in (10) below (left arrows together, right arrows to- 
gether) show that the associative law holds for A iff the corresponding 
coassociative law holds for its transposed k: 


A(A, A) === A(A, A) 
A(T A, A) —*—> A(A, GA) (10) 
Ha} [erm bas || (Gk) 
A(T? A, A) a AA, G’A). 


The argument for the other identity is similar. Clearly, Z is an isomor- 
phism. 


Corollary 3. If (7,7, ) is a monad on the category A while G is 
a right adjoint to the functor T: A — A, then the forgetful functor 
U: A? = A has both a left and a right adjoint. 


Proof: Since by (9) we can identify the forgetful functors U and V, 
the known right adjoint R for V serves also as a right adjoint for U. In 
terms of T-algebras (apply the functor L~'), this right adjoint sends an 
object A to the algebra 


(GA, TGA 74, TG?A “24, GA), (11) 


where e’: TG — TI is the counit of the given adjunction. (Indeed, without 
using coalgebras one can verify directly that this T-algebra TGA + GA 
provides a right adjoint to the forgetful functor U from T-algebras.) Note 
also that the comonad on A, obtained from the adjunction between U 
and its right adjoint, is exactly the original comonad (G, 6, €). 

Notice that the two adjoints for U (which by Theorem 2 is also the 
C-object forgetful functor) whose existence was asserted in Theorem 7.1 
have now been proved to exist, by Theorem 7.2 and Corollary 3 above. 
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We now intend to show that the algebras for a monad in a topos 
E will—under suitable hypotheses—themselves form a topos; curiously 
enough, we must first prove the corresponding result for coalgebras, as 
follows. 


Theorem 4. If (G,6,€) is a comonad on a topos € for which the 
functor G is left exact, then the category Eg of coalgebras for the 
comonad (G, 6, €) is itself a topos. 

Proof: First we observe that the category Eg of coalgebras has finite 
limits. For any functor G, the projections of the product A x B on its 
factors induce a map 


@= ba,B: G(A x B) — GA x GB, (12) 


natural in the objects A and B. The hypothesis that G is left exact, 
i.e., preserves finite limits, simply means that this map ¢ (and similar 
maps for all other finite limits) is an isomorphism. It follows readily that 
for any two coalgebras (A, s) and (B,t), one may construct the product 
coalgebra by providing the product A x B with the structure map 


Ax B-%!3GA x GB + G(A x B). (13) 


The case of other finite limits (terminal object and equalizers suffice) is 
treated similarly. 

Next we construct exponentials in Eg, using the given exponentials 
in €. To this end, consider coalgebras (A, s), (B,t), and (C,u) and the 
right adjoint R: € — Eg which sends each A to the cofree coalgebra 
GA — G?A. By the given adjunctions there are isomorphisms 


Homg(A x B,C) & Home(A, C”) & Home, ((A,s), R(C”)), (14) 
sending f: A x B —+ C into f’ and then f”, where 
f" =Gf'os, f=eo(f’ x1), (15) 
and e: C? x B -+ C is the evaluation map for the exponential in €. We 
hope to get a corresponding transpose for the following set [with ¢~! as 
in (13)]: 
Home, ((A x B, ¢-*(s x t)), (C, u)) 
={f:AxB—C, acoalgebra map}. (16) 
Now amap f: Ax B = C'is acoalgebra map when the following diagram 
in € commutes: 
Ax B—24GA x GB —*"_, G(A x B) 


| Jer (17) 


CSS are: 
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We wish to translate this condition into a condition on the maps f’ and 
f" of (15) above. To do this, we will take the exponential transpose 
(in €) of each leg of the diagram (17). First observe, by (15), that the 
top leg of (17) becomes the top row and then the bottom row of the 
commutative diagram 


Ax B—1,GAx B+. GAx GB —*~ G(A x B) “4 — GC 
em | fen “4 
A x Bap G(C") x B =p G(C®) x GB =» G(C* x B) > GC. 

(18) 


Now, denote the exponential transpose of the last factor G(e) 0 g7! 
by 
pe G(C?y 3 (ec), (19) 


By naturality of transposition, the top leg of (17), i.e., the bottom row 
of (18), now has as exponential transpose 


Af", G(c8) # (aoye® © (acs. (20) 


On the other hand, the bottom leg of (17) has as transpose 


YT) jae “(GO (21) 


We wish to describe those f: Ax B > C (ie., those f’: A > C®) which 
make these maps in € equal. To do this, we transpose each of these maps 
[(20) and (21)] along the adjunction between € and Eg. If (D,k) is any 
coalgebra, this adjunction sends an arrow h: V(D,k) = D— X in E to 
h# = Ghok in Eg and in particular sends f’ to f” = Gf'os. Therefore 
the first map (20) is transposed to the composite G((GC)')oGp-Gf" os. 
But f” is a coalgebra map, so Gf” os = gz 0 f”. The transpose of (20) 
is thus the map 


At a(c®) 5 Gato?) = 


G((GC)’) 
———> 


G((GC)°*) G((GC)*), (22) 


while the second one (21) becomes the map 


a a Gicry = G((GC)*). (23) 
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These last two maps are maps of coalgebras; and in these composites (22) 
and (23), all the objects except the initial object A are cofree coalgebras 
(with structure map some appropriate component of 6, as for any cofree 
coalgebra); while all morphisms, except for the initial f”, are maps of 
such cofree algebras. Hence to make (17) commute we are looking for 
those maps f” which equalize these two arrows in Eg: 


G(c?) “)_, a(acy®) 
| Jeweory (24) 
G(G(C?)) aq O((EC)**). 


But we already know that the category Eg has finite limits, and hence, 
in particular, has equalizers. Therefore, the equalizer in Eg of these two 
arrows, which exists and depends just on the two coalgebras (B,t) and 
(Cu), is the desired exponential in Eg 


(C, u) FB), 
Finally, we construct the subobject classifier in the category Eg. 
First, two little lemmas: 


Lemma 5. If (A,s) is a G-coalgebra and m: D»— A is a subobject 
of A in €, then there is at most one map d: D — GD for which sm = 
(Gm)d; in particular, there is at most one coalgebra structure on D for 
which m is a morphism of coalgebras. 


Proof: The hypothesis gives a commutative diagram 


D—2—= GD 
A—.— GA. 


But G is left exact, so preserves monomorphisms. So Gm is again monic, 
and a map d as in (25), if it exists at all, is unique. If d is a coalgebra 
structure map, the commutativity of (25) states that m is a morphism 
of coalgebras. 


Lemma 6. If in the commutative diagram (25), m is mono and 
(A, s) is a coalgebra, then (D, d) is also a coalgebra and the square is a 
pullback in E. 
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Proof: Consider the diagram 


d | |e Gd (26) 


5A 
om Nee 


G’D. 


GD 7 
The inside square commutes because A is a coalgebra. The bottom 
trapezoid commutes because 6 is natural. The other three trapezoids 
commute by the given commutativity of (25). Since m, and hence Gm 
and G?m, are mono, the outside square of (26) commutes, showing the 
coassociative law for d. The counit law follows from the diagram 


pp -—* 6p == p 


[oR fo 


Here both squares commute, and the bottom composite then is the iden- 
tity because (A, s) is a coalgebra. Since m is mono, the top composite 
is also the identity. Moreover, from this diagram it follows easily that 
the left-hand square is a pullback, as asserted. 

To construct the desired subobject classifier in Eg, start with the 
subobject classifier true: 1 2 in €, and construct first in € the classi- 
fying map r: GQ — 2 of the mono G(true): G1 GQ. Now consider 
any sub-coalgebra m: (D,d) (A, s) in Eg, and denote by h: A > 2 
the classifying map of m: D— A in €. In the commutative diagram 
(with ! any unique map to 1) 


p24, ep) ej i 


>| [om le true fs (28) 


A—,— GA —aa_ GN —_,— 2, 
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the right-hand square is a pullback by definition of 7, the middle is a 
pullback since h classifies m: D»— A and G preserves pullbacks, and the 
left-hand square is a pullback by the last lemma above. So the entire 
rectangle is a pullback, and therefore r o Gh o s is also a characteristic 
map for m, i.e., 

h=7T0Ghos: A>. 


But (A, s) is a coalgebra so Gsos = 6,408, while 6 is natural. Therefore, 
Ghos=GroG*hoGsos=Gro6é90Ghos, 


so that the map Gho s of coalgebras factors through the equalizer—call 
it Ne—in Eg of 1 and Gro dp, as displayed in 


Grof 
Q —— GN ==} GA, (29) 


where GQ stands for the cofree coalgebra on (2. This factorization 
holds for any coalgebra (A,s) and any subcoalgebra D. But trivially, 


the object 1 with unique structure map 1 —+ Glisa G-coalgebra, 
and 1-1 is a sub-G-coalgebra with characteristic map true: 1 = 2 
in €. Therefore, as a special case of this factorization, we have that 
G(true): 1 ¥ Gl — GO also factors through this equalizer e, say as 
G(true) = eotg: 


te 
1— Ne, (30) 


as in the right-hand square in (31) below. Also the map Gho s of coal- 
gebras factors through Ng by a map k as in (31), while the rectangle, 
obtained by composing the two left-hand squares in (28), gives the com- 
mutative diagram 


D——— 1 ———- G1 


“| Ie [ese 


AE Oe == 4 60. 


Ghos 


Moreover, since the original rectangle [= the top rectangle in (31)] is 
a pullback, as is the right-hand square, so is the left-hand square in 
(31). We claim that this makes tg: 1 — Qg a subobject classifier for 
the category Eg of coalgebras. For this, it remains only to show the 
uniqueness of the classifying map k. So suppose ¢: A > Q¢ is any 
other map of coalgebras for which the left-hand square in (31), with 
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k replaced by ¢, is a pullback in Eg. Since e is monic, to prove that 
@ = k, it is enough to show that ed = Gho s, as maps of coalgebras. 
Transposing along the adjunction U: Eg €: R, this is equivalent to 


h=€no0e0¢g (32) 
in €. But 


en0eog=eQ0Grobgoed [by (29)] 
= T0€GE 0 bp Oed (naturality of ¢) 
= Ted [unit law (1)]. 


Now consider the commutative diagram in €: 


D——— | ——— - G1 ——— 1 
"| he [ese | 
A—7— NG > GD —,— 2. 


The left-hand square is a pullback since ¢ classifies m in Eg and the 
forgetful functor €g — E preserves pullbacks (by the construction of 
limits in Eg, as in the beginning of the proof), the middle square is 
trivially a pullback (any square with an iso on top and a mono on the 
bottom is), and the right-hand square is a pullback by definition of r. So 
the entire rectangle is a pullback in €, and, therefore, re@ must be equal 
to the classifying map h for D "A. This proves (32) and so completes 
the proof that Eg is a topos. 


Combining this result with the previous isomorphism between T- 
algebras and G-coalgebras (Theorem 2), we obtain: 


Corollary 7. If (T,7, 4) is a monad on a topos € and T has a right 
adjoint, then the category €7 of T-algebras is again a topos. 


In particular, if C is an internal category in a topos €, the category 
ES of left C-objects is isomorphic by Theorem 7.2 to the category ET 
of T-algebras for a monad T = UF, where U and hence UF has a right 
adjoint. Hence, by the last corollary, E© is a topos. So, finally, we have 
also completed the proof of Theorem 7.1. 


9. The Filter-Quotient Construction 

In a given topos € the Heyting algebra Sub 1 may be “too large”; one 
might want it to consist instead of just two elements 0 and 1, in which 
case € is said to be “two-valued”. Now a too-large Boolean algebra B can 
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be collapsed by dividing it by an ideal or even by a maximal (prime) ideal 
m; the quotient B/m is then a two-element Boolean algebra. Instead of 
using ideals one may use the dual objects, the filters. The corresponding 
reduction will apply not just to Subl in €, but to the whole topos; it 
provides a “filter-quotient” construction. This will be needed in the next 
chapter for the reduction of certain models of set theory. 

We first consider the case of Heyting algebras. A homomorphism 
0: H — K of Heyting algebras H and K is a function which preserves 
all the operations involved (0, 1, A, V, and =). For any such homomor- 
phism 0 the inverse image 0~'(1) of the top element of the codomain K 
is a filter U in H; that is, a subset U C A such that 


l1eU, 0¢U (1) 
v>ueéU implies v € U, (2) 
u,v € U implies uAv € U. (3) 


Conversely, we will prove 


Proposition 1. For any filter U in a Heyting algebra H there is an 
epimorphism 0: H — K onto a Heyting algebra K with 6~1(1) = U. 
The Heyting algebra K is uniquely determined up to isomorphism by H 
and this property. 


Proof: We first consider the case of a principal filter U = Tu, given 
by some element u € H as the set { u' | u’ > wu}. An equivalence relation 
= on H is then defined by requiring that a = b iffaAu=bAu. Let 
a, be the resulting equivalence class of an element a € H, and define a 
partial order of these equivalence classes by 


Oy, <b, iff aNu<bAu. 


On the resulting poset H/u of these equivalence classes, meet and join 
are then given by the formulas 


Ou Abu = (aA d)u, dy V by = (a V d)u, 


so the poset H/u is a lattice with evident top and bottom elements 1 
and 0. Moreover, there is an implication operation defined by 


bu > Cy = [b > clu. 


This has the requisite property (it is right adjoint to A) because a, Aby < 
cy iffaAbAuscAuin H iffaAbAu<c, that is, aANu<(b>c)Au 
and hence ay, < (bu => Cu). Thus we have constructed a Heyting algebra 
H/u and an epimorphism H — H/u with the given filter Tu as inverse 
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image of 1. Moreover, if w < uv < u are elements of H, there are evident 
homomorphisms 


Puy: H/u— H/v, Pow: H/v > H/w (4) 


with composite puw: H/u— H/w. 

The epimorphism 0: H — H/u can also be described without using 
equivalence classes by taking H/u to be the “down-segment” |w consist- 
ing of all a € H with a < u, with partial order, meet and join induced 
from H, while @ is given by 0(a) = aAu. 

Now consider an arbitrary filter V on H and define a congruence 
relation on H by 


a=b6 (mod U) iff there exists uc U withaANu=bAu. (5) 
The set H/U of equivalence classes is then exactly the colimit 


H/U = lim _, H/w (6) 


of the homomorphisms (4), and H/U is a Heyting algebra when the 
operations on the equivalence classes are defined in the familiar way, so 
that H/U is the colimit (6) in the category of Heyting algebras. Thus, 
an element of H/U is an element of some H/u; elements a, of H/u and 
by of H/v are equal in H/U iff there is aw <uAv with puw@ = Dowd, 
and the Heyting algebra operations on a,, and b, are performed in such 
H/w. Thus, for example, 6, = cy is [b > c]y for w as above. This 
construction proves the proposition, and indicates why the standard way 
of constructing an algebra of equivalence classes can here be formulated 
in terms of a direct limit of Heyting algebras. 

Next, consider the corresponding construction in a topos € for a filter 
U of open objects (i.e., of subobjects of 1 € €); here “filter” is defined 
exactly as in (1), (2), and (3) above. In the previous construction for 
Heyting algebras, the maximal element of each H/u was u; correspond- 
ingly, we wish to construct from € and each U — 1 a topos with U itself 
as the terminal object; this requirement suggests the usual slice category 
E/U, where 1: U — U is indeed the terminal object. 

For each U in the filter U/ this slice category €/U is also a topos, by 
Theorem IV.7.1, while pullback along an inclusion i: V — U of open 
objects yields a logical morphism E/U — €/V. The desired “filter- 
quotient” of E€ over U should then be constructed as the direct limit 
Es lim € /U. However, for two successive inclusions i and 7: W + V 
of open objects the composite of the corresponding chosen pullbacks, 


E/U ~+€/v 2+ €/w, (7) 
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is at best just isomorphic to the choice of the pullback (ji)* of the com- 
posite. In other words, the assignments U H €/U, i+ 7* is a functor 
only “up to isomorphism”, and so the usual construction of a colimit of 
the maps (7) does not apply directly. Under such circumstances, it is 
possible to construct a “weak” colimit of functors “up to isomorphism” ; 
instead we will replace each slice topos €/U by a smaller but equivalent 
subcategory €//U for which the pullbacks like (7) can be canonically so 
defined that the composite (7) holds “on the nose”. 

For this purpose, it is convenient to assume that the given topos 
E is equipped with a canonical choice of products X x Y with their 
projections, for all pairs of objects X and Y in €. (For example, there 
is available such choice of products in the category Sets and hence in 
related functor categories.) We can then take the desired category E//U 
to be that full subcategory of the slice category € /U which consists of all 
those objects of €/U of the form of a second projection 72: X x U = U 
for such a canonical product. Actually, it will be easier to regard the 
objects of €//U as just the objects X of the original €. 

Thus, take €//U to be the category with objects the objects X of € 
and with morphisms from X to Y the morphisms f: X x U +> Y x U 
over U in € (that is, morphisms commuting with the projections to 
U). There is then an equivalence of categories 0: E€//U — €/U sending 
X to m2: X x U — U and each arrow f to itself. The reverse map 
@: E/U > E//U is given by 


X ——__4 .y X x U —_ + (971,72) 
* a se S (8) 


Indeed, the evident natural transformations 2 = J and 66 = 1 are 
natural isomorphisms. The proof uses the fact that any two maps A, 
k: Y — U are necessarily equal, since U —1 is mono and 1 is termi- 
nal in €. Because of this equivalence of categories €//U = E€/U the 
former category, like the latter slice category, is a topos and 9, ¢ as 
equivalences preserve the topos structure up to isomorphism, so are log- 
ical morphisms. By the stated property of arrows to U, each morphism 
f: XxU —YxU over U in E//U is completely determined by its first 
projection mf: X xU 4Y. 

Now if i: V — U is an inclusion (of subobjects of 1 in the filter /) 
there are induced maps a = 1 x (i,1) and 8 = 1 x m2 


NEV PSX KU V SE (9) 


Since a map to V (or to U) from any one object is unique, it follows 
readily that Ga = 1 and af = 1. Therefore any map f: X > Y in 
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E//U, that is, a map f: X x U = Y x U over U in €, will determine a 
unique map 


a fx. B 
flV = puvf: XxV > XKUxV So YxKUxVOYxV (10) 


which is a map pyy f: X — Y in E€//V, also written above as f|V. One 
calculates immediately that pyy is a functor €//U — €//V. Moreover, 
puv is a logical functor because under the equivalence of E/U with 
€//U the functor pyy corresponds to the pullback functor i,: E/U > 
E/V, which is logical by Theorem IV.7.2. More is true; by (8) and the 
choice of objects X in €//U, the finite limits, exponents, and subobject 
classifiers there are identical with those in £€, and hence are preserved 
on the nose by € > €//U and by pyy, which is thus a strict logical 
functor. Furthermore, pyy f may be described as that map h: X x 
V — Y x V over V for which mh = mf(1 xi): XxV—Y. From 
this characterization of pyy one concludes that the composite inclusion 
WcVCU gives 


Puw = Pvw opuv: E//U > E//V > E//W. (11) 


Thus, these logical morphisms pyy also compose “on the nose”, as de- 
sired. Because of this one can construct the usual colimit of these cate- 
gories as the category 

Ess lim E//U. (12) 


The successive functors (11) are the identity on objects X, so the 
objects of €,. can be again taken to be the objects X of €. To complete 
the desired colimit category, one then takes each hom-set Hom..(X, Y) 
there to be just the ordinary colimit (in the category Sets) of the sets 
Homey (X,Y). This colimit is calculated, as usual in Sets, by tak- 
ing the disjoint union of all the hom-sets, with elements written as 
(U, fu: X x U = Y x U) and then factoring out by the equivalence 
relation = defined for U, V € U by 


(U, fu) =(V, fv) iff fulW = fulW 


for some W CUNV inU. Write [f] = [fu] for the equivalence class so 
defined for any such (U, fy). For successive morphisms fy: X x U > 
Y xU and gy: Y x V + Z x V over U and V a composite class is then 
defined by choosing some W C UNV in U (always possible since U/ is a 
filter) and then setting 


[9] © Lf] = [(glW) o (f|W)]. (13) 


It follows that the resulting class is independent of the choice of 
W CUNV and that €,, with these classes as morphisms is a category 
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with the same objects X as the category €. Moreover, Puco(f) = [f] is 
a functor 


Puc: E//U — Ex (14) 
and pyoPuUV = PUco- This completes the description of the colimit. 


Theorem 2. If U is a filter of subobjects of 1 in a topos €, then 
the filter-quotient E.. (or E€/U) constructed from € and U by the colimit 
(12) is a topos. For each object U of U the map 


is a logical morphism of topoi; in particular, for U = 1, E > Eg is 
logical. 


Proof: For each structure required in a topos we will construct the 
corresponding structure in €,, and show that the map pyo of (15) pre- 
serves that structure. 

To form a product X x Y in €,, we first observe that in €//U the 
product X x Y is given by the diagram 


XxU +72 Xx YxU—™@25Y xU 


| | | 


GC —_—_—_—_—————. J ————— _ 


For each V C U this maps by pyy to the corresponding product diagram 
in €//V and so yields in €,, a diagram of equivalence classes 


xml yyy mL y. (16) 


To show that these projections have the required universal property, 
consider in €,, any pair 


x [Ff] Z [9] Yy 


of maps to X and Y. They are both realized in some €//U by maps 


XxUc# ZxU-44YxU 


and determine there a unique h: Z x U = X x Y x U with m3h = f, 
To3h = g. Then [™3][h] = [f] and [723][h] = [9] for the diagram (16); a 
corresponding argument shows that this equivalence class [h] is unique 
with these components, as required for the universality of the product 
(16). 
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The construction of equalizers is similar. Given f, g in some €//U 
we construct their equalizer there in the form 


ExU—+XxU=3YxU. 


For any V C U, since pyy is logical, it follows that e|V is the equalizer 
of f|V and g|V. Thus, [f][e] = [gl[e] in €..; one also has that any 
[h]: Z > X in E with [f][h] = [g][A] factors at some suitable stage V 
through e|V and hence factors (and indeed uniquely) through [e]. Then 
[e] is the equalizer in Eo. 

Similarly, each pyy for V Cc U preserves exponentials in the sense 
that it carries each evaluation map in E//U 


ey: XY x Yo X 


into the corresponding ey for €//V. So, given objects X and Y of Ex, 
let (X¥)o. be the object X* and set, for some U in U, 


Coo = [ey]: XY x Yo X. 


Since puy is logical, this map will equal [ey] for any V C U. Its univer- 
sal property follows readily; thus py preserves exponentials and their 
evaluation maps. 


Before constructing the subobject classifier in €,, we first discuss 
monomorphisms there. 


Lemma 3. If f = fy: X x U + Y x U is an arrow of E//U, the 
equivalence class [f]: X —» Y in E.. is mono there iff there is some 
V CU for which f|V: X x V + Y x V is mono in €. 


Proof: For each V C U we may construct the kernel pair of f|V as 
the pullback (over V) 


PxV—+>+Xxv 


| yr 


XxXV—YxV; 


as always, f|V is mono iff p = q in this kernel pair. Now the functors 
Pyoo and pyy preserve pullbacks, so f|V mono for some V implies [f] 
monic in €,,. Conversely, if the equivalence class [f] is mono, then one 
must have [p] = [g] in Eo. and hence, for some V C U, piV = q|V, 
so that f|V is indeed already mono in some category €//V. In other 
words, f is eventually mono, as stated in the lemma. 
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The subobject classifier 0 and the attached “universal” mono true = 
t:1 +2 in € will yield a universal monotx1:1xU —-0xU over U in 
E//U, since € -+ E//U is logical. Then, given some mono [f]: X > Y 
in E,, there is by the lemma a suitable stage U at which f x U is a mono 
in €//U and hence has a classifying map x: Y x U + 2.x U over U 
there. The equivalence class [vy]: Y — 0 is then a classifying map for 
[f] in €o0 for the universal mono ¢ x 1. Thus the projection E//V — Eo, 
preserves the subobject classifier 9 “on the nose”. 

This concludes the proof that €.. = lim €//U is a topos, for any filter 
U of subobjects of 1 in €. One may show that the corresponding Heyting 
algebra Sub 1 in the topos €,. is indeed constructed as in Lemma 1 from 
the filter U/ in Sub 1. 


Exercises 


1. Let 7 be a Lawvere-Tierney topology on a topos €, with associ- 
ated object J as in §1. Show that J classifies dense subobjects, 
i.e., there is an isomorphism of posets DSub(X) = Hom(X, J), 
natural in X € € [where DSub(X) is the poset of dense subob- 
jects of X]. Also prove that D Sub(X) is a lattice. Is it a Heyting 
algebra? 

2. Consider Lawvere-Tierney topologies on a fixed topos €. 


(a) The set of topologies has a natural partial order inherited 
from the internal partial order on 2; so for topologies j, 
k:Q + Q, define 7 < k iff 7 = 7 Ak (where j AK is 


the composite 2 =e, axats Q). Prove that 7 < k 
iff ko 7 = k. Describe the partial order in terms of the 
corresponding closure operators, and also in terms of the 
corresponding dense subobjects. 

(b) For a fixed topology j, show that topologies on Sh; € cor- 
respond exactly to topologies k on € such that k > j. 

(c) Let C be asmall category. A Grothendieck topology K on 
C is said to be finer than another one J iff K(C) D J(C) 
for all C € C. Conclude that Lawvere-Tierney topologies 
on Sh(C, J) are the same thing as Grothendieck topologies 
on C which are finer than J. 


3. Let J be a Grothendieck topology on a small category C, 
and let j be the corresponding Lawvere-Tierney topology on 
Cc (cf. Theorem 1.2), with associated closure operator (-) 
(Proposition 1.1). Check that for a subpresheaf A of a given 


presheaf P its closure A is described simply by A(C) ={ze 
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P(C) | the set of all arrows f: D -+ Csuchthatz-f € 
A(D) form a J-cover of C }. 

4. Consider, for a topology 7 on a topos €, the subcategories Sh, € C 
Sep;& C€ €. First prove that the left adjoint € — Sep, & of 
Corollary 3.6 is left exact. Next, prove that if Sep; € is also a 
topos, then it must coincide with Sh; €. (Hint: prove that the 
subobject classifier of Sep; € must be a sheaf.) 

5. Let 7 be a Lawvere-Tierney topology on an elementary topos 
€, with a corresponding natural closure operator on subobject 
lattices A + A, and with an associated sheaf functor a: € 
Sh; € left adjoint to the inclusion 7: Sh, € > €. 


(a) Prove that if u: A —> B is a j-dense mono in €, then so is 
lxu: Ex A— Ex B, for any object E of E. 

(b) Deduce from (a) that an object E of E is a j-sheaf iff for 
any j-dense mono wu: A — B, the map E": E? — EA is 
an isomorphism. 

(c) Prove that for objects E and F of €, if F is a j-sheaf, then 
the unit 7: E — ia(E) = a(£) induces an isomorphism 
palk) =, Pe. 

(d) Prove that for any object E of € and any subobject A € 
Sub(Z), the closure A can be constructed as the pullback: 


A———> aA 


[ | 


E—— aE. 


6. Let G be a topological group, and let G® be the same group with 
the discrete topology. 


(a) Describe a comonad on Sets such that the category BG? 
of right G®-sets is exactly the category of coalgebras for 
this comonad. 

(b) Describe a comonad on the category BG® such that the 
category of coalgebras is exactly the category BG of con- 
tinuous G-sets; conclude that BG is a topos. 

(c) By “composing”, describe BG directly as a category of 
coalgebras for a comonad on Sets. 


7. Let G be a group object in a topos €. Prove directly that 
U: €© + € has a right adjoint [cf. §7 (8)]. 
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8. 


10. 


11. 


Prove that for a family €, (i € I) of topoi, the product category 
[lier & is again a topos, and that the projection functors 7, : 
II, €& — &; are logical morphisms. 


. [Artin Glueing 4 la Wraith (1974)] Let ¢: € — F be a func- 


tor. Recall that the comma category F/¢ has as objects pairs 
(X,a: Y — @X), where X € €, Y € Ff, and as morphisms pairs 
of morphisms which give commutative squares [CWM, p. 47]. 


(a) Suppose that € and F have finite products, and that ¢ 
preserves them. Show that F/¢ is the category of coalge- 
bras for the comonad on € x ¥, which has the underlying 
functor G: E x F +> € x F, G(X, Y) = (X,Y x 6X). 
Conclude that if € and F are topoi and G is left exact, then 
F/¢@ is again a topos. Prove that the projection F/¢@— E 
is logical. 

Consider the special case were F = Sets and ¢ is the 
global sections functor T = Hom(1, — ): € — Sets. Prove 
that if € = Sh(X) for a topological space X, then Sets/T 
is again a topos of sheaves on a space xX , by giving an 
explicit description of the space Xx. 


(b 


— 


— 
lo) 
— 


(Ultraproducts of Topoi) Let J be a set, and let {&; | « ¢ I} 
be an I-indexed family of topoi, and let 2/ be a filter of subsets 
of f[ (sol eu;U DVeEuUu = UEU;UVEU => 
UNV €U). Define the reduced product [],, €, as follows: the 
objects are the same as those of the product Ther & y&1, 1.€., are 
sequences E = (E,), where E; € €,; the morphisms f: | eee a 
are equivalence classes of sequences f = (f,: E, > F,)eu for 
some U € U, where we identify such an f with the restricted 
sequence (f, | 7 € V), for any V €U with V C U. Use Exercise 8 
and the filter power construction to prove that |], E, is equivalent 
to the filter-quotient [] €;/U/ (for a suitable filter U/ of subobjects 
of 1 in []&;), and hence is a topos. 

(Internal Colimits) Let C be an internal category in a topos €, 
and let A: € — E© be the functor which sends an object 
of E to the trivial left C-object (Co x E,Ci xe, (Co x E) 
C, x E —» E). Prove that A has both a left and a right adjoint. 
[Hint: the left adjoint “lim” sends a C-object (X,7, u) to the 
coequalizer of js and m2: C, xc, X <3. X; this is easy. The right 
adjoint is hard: recall that Cj: € + €/Co has a right adjoint 
IIc,. For a C-object (X,7,), define the required right adjoint 


wv a 
as the equalizer of Ic,(X — Cp) a ©. where the transpose 


IR by 
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of a is the composite C, xq, Ho, (X) aes Co x He, (X) a X, 
€ being the counit of C5 4 Ic¢,; and the transpose of @ is the 


. (™1,do71,72) 1xe 
composite C, x Ig,(X) ————> C, xq, Co x He, (X) —> 


C, XCo X Ear Xx, 


VI 
Topoi and Logic 


Topos theory involves both geometry, especially sheaf theory, and 
logic, especially set theory. This chapter will develop some of the con- 
nections with set theory and illustrate how geometric constructions such 
as sheafification are deeply involved in independence proofs for the ax- 
ioms of set theory. 

After presenting some special properties of topoi which are enjoyed 
by the category of sets, we turn in Section 2 to the famous proof by 
Paul Cohen that the continuum hypothesis (CH) is independent of the 
usual (Zermelo—Freenkel, ZF) axioms of set theory. We shall present a 
perspicuous proof of this independence result, by considering the topos of 
sheaves for the dense topology on the partially ordered set P of Cohen’s 
“forcing conditions”. This proof replaces the sets of the given model of 
ZF by presheaves on the poset P, which has been carefully chosen to help 
force a given big set B between the new (presheaf) natural numbers N 
and its new power set. The resulting topos is an intuitionistic model of 
set theory; it is then sheafified to make it Boolean and finally divided by 
a maximal filter (to make it “two-valued”). A combinatorial condition 
(§3) is needed to insure that this process does not collapse two different 
cardinal numbers. A related and even simpler construction, due to Peter 
Freyd, will subsequently be shown to establish the independence of the 
axiom of choice (AC). 

We present these independence proofs in a way which uses neither 
the formal languages nor the “forcing definitions” from logic. However, 
these ingredients are implicitly there. In Section 5, we will associate with 
each elementary topos a set-theoretic language, its so-called Mitchell- 
Bénabou language. This language strongly supports the intuitive idea 
that topoi are “generalized universes” of sets; to construct objects in 
a topos, one can use not only familiar operations like finite limits and 
exponentials, but also the set-formation process {z | ¢(x)}, where ¢ 
is a formula in the language of a topos. The truth of sentences of the 
language can be described in terms of a convenient “forcing” notation, 
the so-called Kripke-Joyal semantics. This forcing relation for arbitrary 
topoi generalizes Cohen’s forcing technique and explains the connection 
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of our topos-theoretic proof to Cohen’s original proof of the indepen- 
dence of CH. 

In general, the logic governing truth in a topos is not the usual 
(classical) logic, but intuitionistic logic. Thus, an arbitrary topos can be 
viewed as an intuitionistic universe of sets. We shall give one illustration 
of the interesting phenomena that this gives rise to: a topos in which 
every function from the reals to itself is continuous. 

There is an axiomatic presentation of the category of sets—an el- 
ementary topos which is Boolean, well-pointed, satisfies the axiom of 
choice, and has a natural numbers object. This can serve as a founda- 
tion of mathematics, different from the usual foundation by set theory. 
In a final section, we investigate the relation between such a foundation 
and the usual set-theoretic one, and prove that the categorical founda- 
tion is equivalent (more precisely, equiconsistent) with a weak version of 
Zermelo set theory. 


1. The Topos of Sets 


In the first chapters of this book, we have suggested the point of 
view of a topos as a eoneralwed category of sets. For example, we 
considered the topos Sets™ of “sets through time”, the topos Sh(X) of 
sets continuously parametrized by a space X, or the topos Sets©”” of 
sets parametrized by a small category C, etc., etc. 

An arbitrary elementary topos € can (to some extent) be viewed as 
a “universe of sets”: the “sets” are the objects of the topos, on which 
many set-theoretic operations can be performed. For instance, given 
sets (objects of €) X and Y, one can construct the set Y* of functions 
from X to Y and the power set P_X, both as objects of €. In Section 5 
below, we will even show that for set-theoretical formulas ¢(z) one can 
construct in € an object { x | d(x) }. Nonetheless, among all elementary 
topoi the topos of classical sets has many special properties. It is our 
purpose in this section to investigate some of these properties, and see 
which other topoi of presheaves or of sheaves enjoy these properties. 
In many cases, the properties involved resemble familiar axioms of set 
theory. 

The axioms of set theory include an axiom of infinity, to exclude the 
model given by the category of all finite sets. This axiom is ordinarily 
formulated as the existence of some specific infinite set, usually the set 
of natural numbers. 

For an arbitrary topos €, the axiom of infinity states that there is 
an object N (or explicitly, if necessary: Ne) of € with arrows 


1-2,N-4;5N (1) 
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such that for any object X of € with arrows x and f, as below, there is 
a unique arrow h which makes the following diagram commute: 


1_°_ .N—-* 4N 
| f c 2) 


1—,— X —— X. 


f 
The object N is then called a natural numbers object (n.n.o.) for €. For 
such an n.n.o. N, the definition states that the diagram (1) is universal 
among diagrams of the form 1 > X — X. It readily follows that N 
(together with the arrows 0: 1 -» N and s: N — N) is unique up to 
isomorphism. We can thus speak of the natural numbers object of a 
topos €, if there is one. 

In Sets, the usual set of all natural numbers N = {0,1,2,...} has 
the required universal property for an n.n.o. as in (2), where the arrow 
0: 1 — N sends the one element of 1 to 0 € N, while s: N — N is the 
usual successor function n + n+ 1. Given a set X, an element x € X, 
and a function f: X — X, the arrow h uniquely provided by (2) thus 
satisfies 

h(0)=2, — A(n +1) = f(h(n)). (3) 


In other words, A is defined from x and f by recursion, or as one often 
says, “by induction”. This simple form of definition by recursion implies 
most of the other (more general) forms of recursion, as for example 
recursion with a parameter, as used in the recursive definition of addition 
and multiplication for N (cf. Exercise 1). 

Suppose a topos € has a natural numbers object N. Let F be another 
topos such that there are adjoint functors 


FREE FAM, (4) 
g 


with the additional property that g* preserves the terminal object: 
g*(1) = 1. (This is, e.g., the case if g* and g, come from a geomet- 
ric morphism g: F — €, as in the next chapter.) It then easily follows 
that the diagram 


g*(1) 2% g*(n) 2%, g*(N) (5) 


provides a natural numbers object for the topos F: one simply applies 
the (naturality of the) adjunction Homz(g*N, X) & Home(N, g,X). 

This simple observation implies that many of the topoi considered 
before have an n.n.o. For example, if C is a small category then from 
the adjoint pair 


Sets°” —— Sets, A4T, (6) 
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of 81.6, it follows that the presheaf topos Sets©”” has an n.n.o., because 
Sets has one: the n.n.o. of Sets©” is the constant presheaf A(N) with 
value N. 

Moreover, if € is an elementary topos with an n.n.o. N, while 7 is 
a Lawvere-Tierney topology on &, then the topos Sh; € has an n.n.o., 
namely, the sheafification a(N) of N; this follows from the adjoint pair 
inclusion-sheafification ; 

Sh; € ——> € (7) 
considered in §V.3. 

Combining the case of Sets©” and that of a Lawvere-Tierney topol- 
ogy, one concludes that any Grothendieck topos Sh(C, J) has a natu- 
ral numbers object, viz., the associated sheaf aA(N) of the constant 
presheaf A(N): CoP — Sets with value N. Notice that since both 
A: Sets + Sets©” and a: Sets©” — Sh(C, J) preserve arbitrary co- 
products, we have in Sh(C, J) an isomorphism 


aA(N) = [] 1, (8) 


neEeN 


arising from the isomorphism N = ]],,-x 1 in Sets. In other words, the 
n.n.o. in any Grothendieck topos is constructed by taking a countable 
coproduct of copies of the terminal object. 

Another property of the topos of sets is that it is a Boolean topos. In 
general, a topos E is said to be Boolean iff the internal Heyting algebra 
Q—the subobject classifier of £—is an internal Boolean algebra. Here 
are some equivalent conditions: 


Proposition 1. For a topos €, the following conditions are equiva- 
lent: 


(i) E is Boolean; 
(ii) the negation operator =: 2 > 12) satisfies s— = id; 
(iii) for every object E of €, the Heyting algebra Sub(£) is in fact a 
Boolean algebra; 
(iv) for every subobject S— E in E one has 3S V S = E; 
(v) the maps true: 1 > 2 and false = — 0 true: 1 — 1 induce an 
isomorphism 1 +120. 


Proof: Recall from IV.8(13) that the internal Heyting algebra struc- 
ture on 2 corresponds to the external Heyting algebra structure on the 
set Subg(E) of subobjects of any given object E of €, through the iso- 
morphism 

Home(E, 2) = Sube(£), (9) 


natural in E. This Subg(E) is Boolean for each F iff 2 is Boolean; 
i.e., (i) is equivalent to (iii). For any Heyting algebra, the condition 
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a7 = id is equivalent to being Boolean, by Proposition 1.8.4; so (i) is 
equivalent to (ii). Furthermore, (iv) states that complements exist in 
Subg(£), for each E, hence is equivalent to (iii) (cf. Proposition 1.8.2). 
Finally, (v) states that 1+1 is a subobject classifier, with as the universal 
subobject true the first coproduct inclusion true = 7): 1 — 1+1. So 
if this is the case then clearly =— = id, since under the isomorphism 


(true, false): 1 +1 a Q, the map =: 2 — 2 corresponds to the twist 
map tw:1+1-—>1+1, which interchanges the summands, as in the 
diagram 


1+1 (true, false) Q 


| | 


1+1 (ety Q, 

commutative by IV.8(15). On the other hand, (v) is implied by (iv). 
Indeed, the subobjects (true: 1 — 9) and (false: 1 > 2) of 2 have in- 
tersection 0, by the second pullback in IV.8(14), so their join is their co- 
product (cf. Proposition IV.7.6). But by the pullback diagram IV.8(16), 
(false: 1 — Q) = -(true: 1 > 1) in the Heyting algebra Sub¢(Q), so, 
as a special case of (iv), this join is 2, which gives the condition (v). 

This completes the proof of the proposition. 

We now wish to use sheafification to turn a given topos into a 
Boolean one. 

Consider a Lawvere-Tierney topology j on a topos €, with corre- 
sponding natural closure operator S + S on subobjects, as in §V.1. If 
an object F of € is a j-sheaf (that is, F € €;), then by Lemma V.2.4, 
the subobjects of F' in €, are exactly the closed subobjects of F in €, as 
in 


Sube, (F) =ClSube(F) —— Sube(F) (10) 


[here, as in §V.2, ClSube(F) is the set of closed subobjects S C F; 
ie., those S with S = S]. We wish to compare the Heyting algebra 
structures of Sube(F’) and of Sube,(F'). Let us write 0, 1, A, V, >, 7 
for the operations of Sube(F), and 0;, 1,, A,, Vj, =>, 73 for those of 
Subg, (F). 
Lemma 2. For any j-sheaf F, the following identities hold in 
Subg, (F) (for any closed subobjects S and T of F): 
(i) 1,=1, SA; T=SAT, 
(ii) 0; =0, Sv; T=SVT, 
(iii) S >, T=(S=>T), 
(iv) 45 = 78. 
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In other words, this lemma says that finite meets are computed in 
Subg, (F’) as they are in Subg(F), finite sups are computed in Subg, (F) 
by first computing them in Subg(F’) and then taking the closure, etc. 


Proof: (i) The maximal subobject 1 = (F = F) of F is closed, as 
is the intersection S A T of two closed subobjects, so (i) is clear. The 
identity (ii) is clear too, since if 0 is the smallest subobject of F then 
clearly 0 is the smallest closed subobject of F, and S V T is the smallest 
closed subobject containing both S and T. For (iii), we first show that 
S => T is closed. But for any subobject W of F, ifW< S>T 
then WAS < T, hence WAS < T. But WAS = WAS while 
S and T are closed, so WAS < T is equivalent to WAS < T, or 
W <S =>T. In other words, W < S = T implies W < S$ => T for 
any subobject W. Thus S = T is closed. By definition, S >; T is 
the unique closed subobject of F' with the property that for any closed 
subobject R, there is an equivalence R< S >; T iff RA; S > T. But 
S = T is closed, and by Part (i) Aj; = A so S = T has this property. 
Thus (S >; T) = (S = T), as asserted. Finally (iv) follows from (ii), 
(iii), and the identity ,S = (S = 0), valid in any Heyting algebra [in 
particular, in Subg, (F), so that =;5 = S =; (0,)]. 


Theorem 3. In any topos €, the operator —7: 2 — 2 of double 
negation is a Lawvere—Tierney topology, and the resulting category E_. 
of ——-sheaves is a Boolean topos. 


Proof: The three basic properties of negation in a Heyting alge- 
bra (Proposition 1.8.1) at once give the following properties of double 
negation for any subobjects S and T of F in €, 


S < aS; anS = a8, (11) 
a7(S A T) = 13755 A aT. (12) 

This means that 
alae Sub¢g(£) => Sube (E), SwH7s (13) 


is a closure operator. By the basic result about “change-of-base”, 
each arrow f: E — E’ in € gives a morphism of Heyting algebras 
f*: Sub(E’) — Sub(£) (Theorem IV.8.1, external form), so in particu- 
lar a morphism which commutes with -. Therefore the closure operator 
(13) given by —- is natural, and hence —7: 2 — 02 is a Lawvere-Tierney 
topology. 

To prove that the topos €,.. of -—-sheaves is Boolean, we need to 
show that the identity ~—S' = S holds for any subsheaf S of a s—-sheaf 
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E (cf. Proposition 1 above). That S is a subsheaf of E means that S is 
a closed subobject of E, i.e., by assumption 


aS = S, in Sube(£). (14) 


We need to prove that a similar identity i$ = S holds in Subg__(£) 
(but where ~—S now refers to the double negation in €,.!). By Lemma 2 
above, the identity —~S = S in Subg__(£) can be rewritten as an 
identity in Sube(E): =S in €,. is s-(7S) in €, so =7S in €_ is 
a3773745 = (-)§S in €. Therefore —49 = S$ in Sube__(E) is equiv- 
alent to (-)°S = S in Subg(E), and the latter identity holds by (11) 
above, since S is ~—-closed by assumption (14). 

For the case of a presheaf topos, the effect of double negation can be 
stated explicitly as follows. 


Lemma 4. For any subobject A E in Sets©” and any object C 
inC 
(=7A)(C) = {x | « € E(C) and for all f: B — C there 
exists ag: D— B inC withz-f-geA(D)}. (15) 


Proof: For =A, the explicit description is 
(7A)(C) = {x | x € E(C) and for all f: BC, az- f ¢ A(B)}, (16) 


as stated in I.8(19)—or as may be proved directly by showing that 
the subobject described by (16)satisfies the definition of negation. For- 
mula (15)of the lemma follows by applying (16)twice. 


Corollary 5. For any presheaf topos Sets©’” the dense topology 
coincides with the double negation topology. 


Proof: First recall from V.4(6) that for any Grothendieck topology 
J on C, the corresponding natural closure operator on subobject lattices 
can be described directly in terms of J, as follows: if E is a presheaf on 
C and A € E is any subpresheaf, then for all objects C C C and all 
zé E(C) 


zéA(C) iff thesieve {f: B—oC|a-f €¢ A(B)}coversC. (17) 


But recall (III.2.9) that the dense Grothendieck topology J on C has 
the property that a sieve S on C covers in this topology iff for every 
arrow D — C in C there exists an arrow B — D in C such that the 
composite B — C belongs to S. Thus, for the dense topology (17)gives 


zé€ A(C) iff for every f: B > C in C there exists a g: D> B 
in C with z- f-g € A(D). 
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But this is exactly the description of ~—A given in (15) above. 


For this reason, the dense topology is also called the double negation 
topology. 

In III.8(21) we proved that for any sheaf F for the dense topology on 
a small category C the lattice Sub(F’) of subsheaves of F’ is a Boolean 
algebra. This gives a second proof that Sh_.(Sets©’”) is Boolean. 

Another property of the category of sets is that there are only two 
subobjects of the terminal object 1, namely, 1 itself and the empty set 
(= the initial object). In general, a topos € is called two-valued if 0 
and 1 are the only subobjects of the terminal object 1 of €. Because 
of the isomorphism Hom(1,2) = Subg(1), this can be expressed also by 
stating that the subobject classifier 2 of € has only two global sections; 
in other words, there are only two global “truth-values” . 

Although (v) of Proposition 1 might suggest this, being two-valued 
has nothing to do with being Boolean; two-valued means that there 
are only two global truth-values, whereas condition (v) of Proposition 1 
states that from an internal point of view 1+ 1 is the truth-value object. 
For example, if J is a set with at least two elements, then the slice 
category Sets/I is Boolean but not two-valued. 

A different example is the presheaf topos Sets” for a monoid M. 
Here the subobject classifier 2 = Q4,, as explained in §1.4, is just the 
set Naz of all right ideals R in M, with the action of f € M on aright 
ideal R defined by R- f ={heM | fh eR}. Thus an arrow 1 > 2 is 
given by a right ideal R with every R- f = R. But ifsuch an R contains 
any f, R- f is all of M, so there are only two arrows 1 — Q, sending 1 
to ¢ or M. This shows that this topos Sets””” is two-valued. However, 
it is Boolean only if M is a group, as stated in Exercise 2 below. 

An arbitrary Boolean topos € may be turned into a two-valued topos 
by the filter-quotient construction of §V.9. One uses Zorn’s lemma to 
find a maximal proper filter of subobjects of 1, and then applies the 
following proposition (see also Exercises 4 and 6). 


Proposition 6. Let € be a Boolean topos, and let U be a maximal 
filter of subobjects of 1 in €. Then the filter-quotient topos E/U is 
two-valued (and again Boolean). 


Proof: Since the canonical functor E — £€/U is logical, it is clear 
[e.g., from (v) of Proposition 1] that E/U is Boolean if € is. Since U 
is a maximal filter any two subobjects U, V C 1 in E have (see also 
Lemma X.10.1) 


UvVVeEUu ifU CEUorVed. 


In particular, for any subobject U ¢ 1, either U € U or ~U €U. Now 
suppose 1 —» 2 is a morphism in €/U. By the description in §V.9, this 
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morphism can be represented by an arrow 1 x U — 2.x U over U and 
hence by an arrow f: U — © from some subobject U € U, while for any 
smaller U' Cc U with U’ EU, the restriction of f to U’ still represents 
the given morphism 1 — 92. Let V C U be the subobject of U classified 
by fin €. Then V €U or =V EU; hence V EU or (UN-V) EU. But 


t 
f classifies V, so the restriction of f to V is V —~— 1 Balas ©, while 
false 


the restriction of f to (UN =V) is UN ~V —— 1 —— ©. Thus, the 
only morphisms 1 > 2 in €/U are the two represented in € by true and 
false: 1 = 9. This proves that €/U is two-valued. 


A next property which (according to the authors) the category Sets 
enjoys is the axiom of choice. This axiom can be expressed in many 
equivalent ways. The most familiar version asserts that the product 
Thez X; of a collection of nonempty sets X; is again nonempty; or, 
alternatively, that any surjection p: X — I of sets has a section s: I > 
X, 80 ps = 1. [To pass from one version to the other, let X; = p~1(i), 
X= [[X,.] 

An arbitrary topos € is said to satisfy the axiom of choice (AC) if 
every epimorphism X — I in € has a section. From the point of view 
of “the internal logic” of a topos, a weaker property is more relevant, 
however. Say that a topos € satisfies the internal axiom of choice (IAC) 
if for any object E of €, the functor 


(])2E.:€, 


given by exponentiation with E, preserves epimorphisms. Notice that 
if p: X +I is a morphism in € with a section s: J — X, then for any 
E €€ the map p®: X* — T* again has asection s”: JF — X¥. Thus, 
a topos satisfying (AC) also satisfies (IAC). The converse is not true 
(cf. Exercise 5). R. Diaconescu has shown that a topos which satisfies 
(IAC) is necessarily Boolean (Exercise 15). We shall come back to (the 
failure of) IAC in §4 below and to its relation with the internal logic in 
§5 and 86. 

In the category of sets, each function with domain the set A is com- 
pletely determined by its effect upon the elements x of that set A; in 
other words, f # g: A— B implies that there is an element z:1— A 
for which fx # gx. This amounts to asserting that the one-point set 
1 is a generator of Sets; we recall here that a family G of objects of a 
category C is said to generate C iff f 4 g: A > B in C implies that 
fu #4 gu for some arrow u: G — A from an object G in the family G. In 
general, a topos € is called well-pointed if the terminal object 1 generates 
E; to exclude the trivial example for which € is the category with only 
one object and one arrow, we will always assume that a well-pointed 
topos is also nondegenerate; i.e., 0 #1. Notice that a nondegenerate 
topos is well-pointed iff the functor Home(1, —): € — Sets is faithful. 
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There is also a related important property of a topos €, viz., that 
the family of all subobjects of 1 generates €. This property will be 
extensively discussed in Chapter IX. 

One has the following relations between the properties just men- 
tioned. 


Proposition 7. A well-pointed topos E is both two-valued and 
Boolean. 


Proof: First observe that any nonzero object U (i.e., an object U 
not isomorphic to the initial object) in E must have a “global section”, 
that is, an arrow 1 — U. For U has at least two different subobjects, U 
and 0, hence two different characteristic maps U9. Since € is well- 
pointed, there must be an arrow 1 — U distinguishing these two maps 
Un. 

Now suppose m: U — 1 is a subobject of 1 in €. If U # 0, then there 
is a morphism s: 1 — U as we just showed, so ms = 1, msm = m and 
hence U1 must be an isomorphism. Thus € is two-valued. 

Next, we prove that € is Boolean. To this end, take a subobject 
S' — E of an object E of €; we shall show that SV ~S = E [cf. Propo- 
sition 1(iv)]|. Suppose to the contrary that Sv -~S # E. The two 
characteristic maps £31) for the different subobjects S V =S and E 
can be distinguished by a map z: 1 — E since € is well-pointed. In 
other words, there is a map zg: 1 — E which does not factor through 
SV —-S. But consider the pullback 


V+ Sv AS 


ee 


1l—,— E. 


Since € is two-valued, V is either 0 or 1. If V = 1, then z does factor 
through S V —S, contradicting its choice. If V = 0 then, a fortiori, the 
meet in Sub(£) of z: 1 E and SE is 0, so = factors through —S, 
again contradicting the choice of x. Thus, SV ~S = E. 


Proposition 8. Let € be a topos which is generated by subobjects 
of 1, and moreover has the property that for each object E, Sub(E) is a 
complete Boolean algebra. Then E satisfies the axiom of choice. 


Proof: Let p: X — I be an epimorphism in €. By completeness 
of Sub(I), we can apply Zorn’s lemma and find a maximal subobject 
m: MT such that p has a section s: M > X, i.e., ps = m. Suppose 
M #1. Since Sub(/) is Boolean, M has a complement ~M which is 
nonzero because M # I. Hence, since subobjects of 1 generate €, there 
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is anonzero V C 1 in € and amapt: V —~ —~M. Consider the pullback 
xX! > X 


Ve +aM> — f, 


t 


Since p is epi so is p’, and therefore X’ #4 0 since V #4 0. Again since 
subobjects of 1 generate, there is a nonzero object W C 1 and a map 
r: W — XX’. Then p'r: W — V so W CV. Moreover, 


t|W:W tw) cI 


is an isomorphism [epi because t(W) is by definition the image of W, 
mono because W ¢ 1]. But Mmt(W) = 0, so their supremum as a 
subobject of I is their coproduct, and s: M — X and rt7!: t(W) — X' 
patch together to form a section M Ut(W) — X. Since t(W) # 0, this 
contradicts the assumed maximality of M. 


Corollary 9. Let P be a partially ordered set. Then the topos 
Sh(P, ——) of sheaves for the dense topology satisfies the axiom of choice. 


Proof: By Corollary 5, Sh(P, ~—) is Boolean. Being a Grothendieck 
topos, it has complete subobject lattices (cf. Proposition III.8.1). So the 
corollary follows by the preceding result, provided it can be shown that 
Sh(P, +-) is generated by subobjects of 1. Let a: Sets” — Sh(P,--) 
be the usual associated sheaf functor. At the end of §III.6, we observed 
for elements p of P that the associated sheaves ay(p) of representable 
presheaves y(p) = P( — ,p) generate Sh(P,-—). But for any p € P, the 
map P(-—,p) > lin Sets?” is obviously mono. By left-exactness of 
a, the map ay(p) — 1 must then be mono in Sh(P,—-), i.e, ay(p) is 
a subobject of 1 for any p in the poset P, and these sheaves generate 
Sh(P,--). The result thus follows from Proposition 8. 
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Cantor’s diagonal argument shows that the real numbers are not 
denumerable. Now if we write N for the set of natural numbers, the 
real numbers can be identified (at least as far as their cardinality is 
concerned) with subsets of N, i.e., with elements of the power-set PN. 
Thus the diagonal argument shows that there is no bijection N > PN, 
so that the strict inequality N < PN holds for the corresponding cardi- 
nal numbers. 

Cantor’s continuum hypothesis then asserts that there is no (infinite) 
cardinal number between N and PN; in other words, that every infinite 
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set of real numbers is either denumerable or has the same cardinality as 
the set of all reals. Paul Cohen devised the method of forcing so as to 
violate this hypothesis. In a nutshell, his method is as follows. Starting 
with a suitable model S of set theory, take in S some set B larger than 
PN (e.g., the set B = PPN, which is strictly larger than PN, again 
according to the diagonal argument). Then construct a new model S’ 
of set theory in which there is a monomorphism 


g: BPN. (1) 


This will (almost, see below) mean that N < gB < PN holds for the 
corresponding cardinal numbers in S’. Since PN = 2N, one may replace 
the desired function g of (1)by its transpose 


f{:BxN-2, (2) 


where f(b,n) =0 or 1 accordingly as n € g(b) or n ¢ g(b). Thus giving 
f really amounts to giving the graph in B x N of a many-valued function 
from B to N, consisting of all those pairs (b,n) with n € g(b). In order 
that g be a monomorphism one must require that, for b and b’ € B, 


b # Bb’ implies f(b,n) A f(b’, n) for some n. (3) 


In the given model S of set theory there is (according to the diagonal 
argument) no such function f. There are, however, finite approximations 
to f. Such a finite approximation will then consist of a finite subset 
F, C BxN anda function p: F, — 2. One calls such (Fp, p) a condition 
p. In other words, a condition consists of two disjoint lists (b,,n;) and 
(c;,m,;) of elements of Bx N (i=1,...,4,j =1,...,0), with k, 2 finite 
and 

P(h., ni) = 0, P(Cy,™,) =1. 


These conditions constitute a poset P, with partial order defined by 
q<p iff F, 2 F, and q restricted to F, coincides with p. (4) 


Thus if g < p, then g is a closer approximation than p to the function 
f,i-e., g gives more information about f, and one usually says that the 
condition g is an “extension” of p. (In set-theoretic treatments, such a 
poset P is often called a “notion of forcing”, and the elements of P are 
called forcing conditions. The relation to forcing will be explained in 
886, 7 below.) 

Now start with the topos Sets of classical sets, and consider this 
Cohen poset P equipped with the dense topology. The Cohen topos is 
by definition the corresponding category of double negation sheaves 


Sh(P, --). (5) 
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As we saw at the end of the previous section, the Cohen topos is a 
Boolean topos (in which the subobjects of 1 generate) satisfying the 
axiom of choice. 

The purpose of this section and the next is to prove that in the Cohen 
topos Sh(P, —-), there exists an object K together with monomorphisms 


No KON, (6) 


where N is the natural numbers object of Sh(P, =—) and 2 is the subob- 
ject classifier; moreover, in Sh(P, ——) there is no epimorphism N > K, 
nor is there an epimorphisms K — ON. In other words we will prove: 


Theorem 1. There exists a Boolean topos satisfying the axiom of 
choice, in which the continuum hypothesis fails. 


We recall that K. Gédel had used “constructible” sets to provide 
a model of Zermelo-Freenkel (ZF) set theory in which the continuum 
hypothesis holds. 

Before we embark on the proof of Theorem 1, we should make a few 
remarks (some of which are addressed mainly to readers familiar with 
the theory of models of set theory). First of all, the topos-theoretic 
argument that we will present has essentially the same mathematical 
content as the original proof by Cohen that the continuum hypothesis 
is independent of the usual Zermelo—Freenkel axioms of set theory. A 
Boolean Grothendieck topos—such as the Cohen topos Sh(P, ——)—is a 
perfectly good “universe of sets” in which to do classical mathematics, 
but it is not exactly a model of ZF. However, it is not difficult to obtain 
a model of ZF from such a topos, by mimicking the construction of 
the cumulative hierarchy V,, defined for ordinal numbers a as Vo = 
0,...,Va+1 = P(Vq), etc., inside the topos. (This is worked out in 
[Fourman, 1980].) The relation to Cohen’s forcing method will also 
become more apparent in Sections 6 and 7 below, where we will develop 
a “forcing semantics” for an arbitrary topos. In addition, we should 
mention that, in our argument, the category Sets can be replaced by 
any elementary Boolean topos S which has an n.n.o. and satisfies the 
axiom of choice (for example, S can be a countable model of ZFC). It 
follows that the category Sh_.(SP°”) of sheaves for the Lawvere-Tierney 
topology ~— on the elementary topos SP” is a topos (by Theorems V.2.5 
and V.4.1), which is Boolean and satisfies the internal axiom of choice 
(or the external axiom of choice if S is well-pointed), and in which the 
continuum hypothesis fails. Finally, we remark that by using a filter- 
quotient construction, one can obtain a stronger version of Theorem 1: 
there exists a two-valued Boolean topos satisfying the axiom of choice 
but not the continuum hypothesis (see Exercise 7). 
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Let us now start our analysis of the Cohen topos, towards the proof 
of Theorem 1. We shall be mainly concerned with two kinds of sheaves, 
the representable ones and the “constant” ones. 


Lemma 2. For any p in the Cohen poset P, the representable 
presheaf y(p) € Sets? ” is a sheaf for the dense topology. 


The fact that y(p) is a sheaf will be used essentially in §3. 


Proof: Suppose that D is a sieve on q € P, so that d < q for all 
d € D. Suppose also that D covers q in the dense topology; then for 
r<qinP thereisad€ Dwithd<r. Let {zq|d€D} bea matching 
family of elements of y(p), so that zg € y(p)(d) = Homp(d,p). Since P 
is a poset, this matching family shows that 


d<p  foralldeD, (7) 


and to show that an amalgamation in y(p)(q) exists—necessarily unique 
since P is a poset—it clearly suffices to show that g < p. Suppose to 
the contrary that g does not extend p. Then there is a pair (b,n) in the 
domain Fy of p for which either q(b,n) 4 p(b,n) or q(b, n) is undefined. 
Let q’: F, U {(b,n)} — 2 be the condition obtained from q by adding 
a value at (b,n) different from p(b,n), if necessary. Then q’ < q, so 
by density of D there is a d < q’ with d € D. Then d < p since 
d(b,n) # p(b, n), contradicting (7). 

Our next step is the construction in the Cohen topos Sh(P,—7) of 
something like the desired map B x N — 2 of (2); in other words, of 
a subobject of B x N. Actually we first build up a subobject A of the 
constant functor AB x AN = A(B x N) in the category Sets?” of 
presheaves, i.e., a subfunctor A Cc A(B x N). The definition of A comes 
directly from the Cohen poset P: for any p € P, 


A(p) = {(b, 7) | p(b,n) = 0}. (8) 


Thus, A(p) is a subset of B x N, while if g < p in P then A(q) contains 
A(p). So A: P°P — Sets is indeed a subfunctor of the constant functor 
A(B x N). 

Lemma 3. The functor A is a closed subobject of A(B x N) with 
respect to the dense topology; in other words, 


a7A=A_ in Sub(A(B x N)). 
Proof: Choose p € P, b € B, andn EN. As stated in §1 (below 


(17) there), one has (b,n) € —7A(p) iff for all qg < p there exists an 
r <q with (b,n) € A(r), ie, r(b,n) = 0. Now if (b,n) ¢ A(p), then 
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either p(b,n) = 1, in which case r(b,n) = 1 also for any r < p, so 
(b,n) ¢ =—A(p); or else p(b,n) is undefined, and in this case we can 
choose g < p with q(b,n) = 1. Then r(b,n) = 1 for any r < q, so again 
(b,n) ¢ -7A(p). This shows —7A < A. The reverse inclusion is obvious 
(and holds for any closure operator derived from a topology). 


Let us write © for the subobject classifier of Sets’ ’”, and Q_., for 
that of Sh(P, =). Then 2__ is defined to be the equalizer of the identity 
and —7: 232, and Lemma 3 then states that the characteristic map 
AB x AN — 2 of the subobject A factors through 0. say as 


f: ABx AN &©.. (9) 
The transpose of this map f is a map of presheaves 
g: A(B) 3 24), (10) 


Lemma 4. The map g is a monomorphism in Sets’. 


Proof: Since g is a map of presheaves, where monos are tested point- 
wise, it is enough to prove for each condition p that g,: A(B)(p) > 
(2EB))(p) is a mono of sets. To this end, we make the map gp, more 
explicit. First, A(B)(p) is the set B itself, while by the construc- 
tion of the exponential (22M) (p) is the set of natural transformations 
y(p) x A(N) = QD. CO. For be B, 


9p(b): y(p) x AN > Q. 


is the natural transformation given, for g < p and n € A(N)(q) = N, 
according to the definition (8) of A by 


gp(b)(q,n) ={reEeP|r<q, r(b,n)=0}. (10a) 


Now suppose that b and c are different elements of B. Since conditions 
such as p are finite, neither p(b,n9) nor p(c, no) is defined for no € N 
chosen sufficiently large. Hence one can construct a condition r < p with 
r(b,no) = 0 and r(c,no) = 1. Then r € gp(b)(p, 0) but r ¢ gp(c)(p, No). 
So gp(b) # gp(c), and the lemma is proved. 

Recall that the inclusion of sheaves into presheaves has a left adjoint, 
the associated sheaf functor a, as in 


Sets —*— Sets?” —*- Sh(P,—\). (11) 


In this section and the next, we will write 9 for the sheaf aA(S) corre- 
sponding to a set S. 
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Corollary 5. The associated sheaf functor sends the map g of (10) 

to a monomorphism mn 
m: B= ON 
in the Cohen topos. 

Proof: The associated sheaf functor a is left exact, hence preserves 
monos; so it sends the presheaf monomorphism g to a monomorphism 
of sheaves 

m = a(g): aA(B) > a(M4®)), 
By definition, aAB = B and aAN = N, so it remains to show that 
a(M%)) ~ 924%) But for an arbitrary presheaf X, there are natural 


bijections, where Hom denotes maps of presheaves and Homs, maps of 
sheaves, 


Hom(X,94%)) = Hom(A(N) x X, 2...) 
(since 2. is a sheaf:) = Homs,(a(A(N) x X), 2.) 
(a is left exact:) = Homsn(aA(N) x aX, 2.) 
~ Homsi (aX, 224)) 
(224) is a sheaf:) = Hom(X,224%)). 


Then applying the Yoneda lemma in both directions gives maps which 
by naturality show that the sheaves OS) and OQN_ are isomorphic. 
[This argument of course applies more generally, cf. Exercise 5(c) of 
Chapter V.] 

The natural numbers object N of Sets gives a new natural num- 
bers object N of the Cohen topos Sh(P,-=7), while O., ~1+1= 3 
[cf. Proposition 1.1(v)] is the subobject classifier of Sh(P,--). Thus, 
aN x & P(N) is the “power-set” of the natural numbers in Sh(P,--). So 
our construction using the poset P and its corresponding presheaf A has 
“forced” the given large set B—or more exactly, the corresponding sheaf 
B—to be inside the new power-set ON by way of the monomorphism 
m just described. Since also N — B in Sets, we thus have 


NB P(N) —3N (12) 


in Sh(P,-—). However, for all we know now the inclusions in (12)may 
not give strict inequalities of cardinal numbers: perhaps B has become 
countable in Sh(P, 7), or of the same cardinality as P(N)! 

To prove that the continuum hypothesis fails in Sh(P,—=-—) we will 
now prove that the strict cardinal inequalities N < 2N < B in Sets will 
give strict cardinal inequalities 


N<2N<B (13) 
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between the corresponding sheaves in the Cohen topos. This will occupy 
the next section. Once this has been achieved, the proof of Theorem 1 
will be complete, since from (12)and (13)we obtain strict cardinal in- 
equalities 


N <2N <QN (14) 
in Sh(P,——). The sheaf B has disappeared from the result (14); its only 
task was to force the “real power-set” 2N in Sh(P, =) to become very 
large—much larger than the “fake power-set” 2N coming from the power- 
set 2N of the topos Sets. It is this fake power-set and not necessarily B 
which violates the continuum hypothesis. 

In set theory, this method of Cohen has been used to establish many 
other independence results (for example, for the Souslin hypothesis on 
the real line, or to find a model of set theory in which every subset 
of the reals is Lebesgue measurable). In each such case, one uses a 
partially ordered set P of conditions chosen to fit the circumstances; 
these posets P are called “notions of forcing”. The usual presentations 
have a somewhat different intuitive description, in that the role of the 
ultrafilter (in our case, used for the filter quotient) is emphasized; thus, 
for the Cohen P above, one constructs a generic filter G C P so that 
each pair (b,) is in the domain of at least one p in G and so that any 
two b,, be with b 4 be are separated by some p, with p(b,,n) 4 p(be, n) 
for some n. The union of these (b,n) € G then essentially provides the 
desired f*: B x N — 2. In this view, the role of the “generic” set is 
emphasized; one speaks of a model of set theory generated by this generic 
set. This view does not appear so strongly in our presentation, which 
is more directly motivated by an alternative formulation in terms of 
“Boolean-valued” models of set theory, discovered by Scott and Solovay, 
not presented by them, but explained for example in a book by J. L. Bell. 
(A Boolean-valued model is one in which the validity of set-theoretic 
formulas is given by truth values lying in some large Boolean algebra— 
in our presentation, the Boolean algebra 2...) 

Nevertheless, it is our clear understanding that the ultimate mathe- 
matical content of all these methods (generic sets, Boolean-valued mod- 
els, and double-negation sheaves) is essentially the same. Indeed, a 
reading of the original paper by Paul Cohen clearly reveals the role 
there of double-negation. And sheafification has a wraith-like presence 
in Cohen’s paper. 

Perhaps a full understanding makes use of all three approaches— 
generic sets, sheaves, Boolean-valued models! 

Let us summarize the sheaf-theoretic argument. In a given category 
of sets we have the set N of natural numbers, its power set 2N and 
a still larger set B, so there is no mono g: B — 2N. Introduce the 
poset P whose elements are “finite states of knowledge” about such a 


284 VI. Topoi and Logic 


(nonexisting) mono g. Thus each p € P specifies for each of a finite 
number of elements b € B a finite subset gp(b) C N and a disjoint finite 
subset gi(b), consisting of those elements already “known” not to be 
in g(b). A subsequent state g < p in the partial order P is one with 
(perhaps) larger sets g,, gi. Now treat these states of knowledge as if 
they were open sets in some “space” and specify that a sieve S on g 
will “cover” g in the corresponding (dense) topology iff for all r < gq, 
there is s < r with s € S. Among the functors H: P — Sets we 
distinguish those which are sheaves for these coverings. In particular, 
each p determines such a functor y(p) = Hom(-—,p) which is a sheaf, 
as proved from density by the way we can construct “conditions” s < gq. 
Then we shift from sets B to presheaves AB to sheaves B, as in 


(B,N,2) + (AB, AN, 9) 4 (B,N, Q_.), 


where the new truth values form the sheaf 2... with each Q..(p) the set 
of all covering sieves on p. By the definition of the poset P (so defined 
for this very purpose) we can “mimic” the desired mono g by a map of 
presheaves 


gp: (AB)(p) — NAN (p) = Nat(y(p) x AN,Q) 


in accord with the definition of the exponential objects in any functor 
category. Moreover, the description of this g, is such that the values 
of these natural transformations actually lie in the subobject classifier 
for sheaves. But the process of sheafification is left exact, so preserves 
monomorphisms and thus produces a mono, 

g: B. ON 
as stated in Corollary 3. This does indeed give the desired result, because 
y(p) is a sheaf and because, Sh(P) being Boolean, the passage S + S 
from sets to sheaves actually preserves cardinalities (proper monos stay 
proper)—as follows in the next section from a special countable chain 
condition (the Souslin property) of the Cohen poset P. 

In brief, sheaves for the dense topology on the poset of finite states 
of knowledge about the desired impossible monomorphism form the new 
model of sets in which that mono is really there. 


3. The Preservation of Cardinal Inequalities 

In this section we will define, for any two objects X and Y in a topos 
E, an object Epi(X, Y) C Y%*, called the “object of epimorphisms” from 
X to Y. This object has the property that Epi(X,Y) = 0 implies that 
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there is no epimorphism X — Y. Then we define for objects X and Y 
of €, 


X <Y iff there is a monomorphism X — Y, and Epi(X,Y) = 0. 
(1) 
Thus in the topos of sets, X < Y implies that the cardinality of X is 
strictly less than that of Y. For the Cohen topos Sh(P, ——), we will also 


prove for infinite sets S and T with corresponding sheaves S andT (as 
in §2) that 


Epi(S,T) & 0 (in Sets) implies Epi(S,T) 0. (2) 

Since the functor = aA: Sets — Sh(P,—-), which sends a set S to the 
sheaf S, is left exact, and hence preserves monos, this also means that 

S<T implies $< T. (3) 


Now in Sets we had the object N of natural numbers and we had also 
chosen a large set. B with 


N<2N<B (in Sets). (4) 
By (3) above and (12) of §2, this gives 
N<2N <B P(N) (5) 
in the Cohen topos, where m is the monomorphism of Corollary 2.5. It 
follows by Lemma 5 with m: Z — Y below that 


N <2N < P(N), (6) 


which is exactly what was needed to complete the proof of Theorem 2.1. 

Let us fix objects X and Y in a topos € and define the object 
Epi(X,Y) of epimorphisms from X to Y. For a “parameter object” 
E of E, we first define an operation 


img: E(E,Y*) > E(E,Q”), (7) 


as follows. Given f: E > Y%, let f: Ex X —Y be its transpose, and 
let Img(f) be the image of the map (m, f): E x X > Ex Y, as in the 
diagram 


ExX (mF) »ExY 


SS rs (8) 


Imz(f). 


Thus, Img(f) is a subobject of ExY, and we define img(f): E> OY to 
be the transpose of the characteristic map E x Y — Q of this subobject 


Imz(f). 
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Lemma 1. The map img: €(E, Y*) > €(E,Q”) is natural in E. 

Proof: Let a: E’ — E be an arbitrary map of €. We need to show 
that for an f: E — Y%, the identity img (f oa) = img(f) oa holds. 
Now fa = f(a x 1): E’ x X — Y; thus, the diagram 


Bix xX 4S BX 
crf I Sy (9) 


E’xY — ExyY 
axY 
is easily seen to be a pullback. Therefore Img(fa) is the pullback of 
Img(f) along a x Y, because epi-mono factorizations are stable under 
pullback, cf. Theorem IV.7.2. But, as indicated in the diagram below, 
pulling back subobjects along a x Y: E’ x Y — Ex Y corresponds to 
composition of characteristic maps with ax Y, and the latter corresponds 
to composition with a@ under the exponential adjunction: 


Sube(E x Y) = Home(E x Y,Q) = Hom,(E, 2") 
ory} io Ie (10) 
Subg(E’ x Y) = Home(E’ x Y,Q) = Hom,(E’,9"). 


This proves the lemma. 
By the Yoneda lemma, we conclude that the above natural transfor- 
mation 
img: E(E,Y*) > €(E, 0”) 
is induced via composition by a uniquely determined map 


ime ¥* = 0", (11) 


t 
Now let ty: 1 > QY be the transpose of 1 x Y —> 1 = , and 
define Epi(X, Y) as the pullback of ty along im, 


Epi(X, Y) ———— 1 


|» (12) 


y* ——__. 0", 
Intuitively, Epi(X, Y) has thus been constructed as the object of those 
functions from X to Y whose image is all of Y. (There is also a more 
“elementary” description, which doesn’t use Hom-sets, cf. Exercise 8.) 
More formally, Epi(X, Y) “classifies” parameterized epimorphisms in the 
following sense: 
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Lemma 2. For any object E of €, a morphism f: E + Y~ factors 
through the subobject Epi(X,Y)—Y~ iff (m, f): Ex X ~ EXY is 
an epi in €. 


Proof: By the pullback (12), such a map f factors through 


Epi(X,Y) iff imof = img(f) is the map E —> 1 —> QY. Taking 


the exponential transpose, this means by the definition of img that the 
t 
characteristic map of Img(f)— Ex Y is the map Ex Y —> 1 AEG 


This, by (8)in turn, means that (m, f): Ex X — Ex Y is epi. 
This result includes the desired property of Epi: 


Corollary 3. In a nondegenerate topos, Epi(X, Y) = 0 implies that 
there is no epimorphism X — Y. 


Proof: Any g: X — Y can be expressed, via X = 1x X, asa 
transpose g = f of some f: 1 > Y%. If g is epi, the lemma with E=1 
shows that f must factor through Epi(X,Y) by some map £ = 1 > 
Epi(X,Y). But Epi(X, Y) = 0, while by Proposition IV.7.4 any arrow 
to 0 in a topos is an isomorphism, hence 1 = 0. But 1 $ 0, since the 
topos is not degenerate. 


One has to be somewhat careful with the functorality of Epi(X, Y). 
However, the following two lemmas suffice to deduce (6) from (5) above. 


Lemma 4. Let p: Y — Z be an epimorphism in €. Then the 
induced map p* : Y* — Z* restricts to a map Epi(X,Y) — Epi(X, Z). 


Proof: By the natural correspondence expressed by Lemma 2, it 
suffices to prove for an arbitrary arrow f: E — Y™* in € that if 
(m, f): ExX — ExyY isepi, then so is (™, (p* of)): ExX S ExZ. 


But (7, (p* o f)) = (Ex p)o(m, f); so this follows from the fact that 
the product E x p is epi if p is, as one may prove by exponential trans- 
position (cf. Proposition IV.7.7). 


Lemma 5. In a Boolean topos, let X be an object, m: Z => Y a 
mono and zg: 1 > Z a global section of Z. If Epi(X, Z) = 0 then also 
Epi(xX, Y) = 0. 


Proof: Since € is Boolean we may write Y = Z+Z', where Z’ = nZ 
is the complement of Z in the Boolean algebra of subobjects of Y. Then 
id: Z > Z and Z'’ > 1 > Z patch together on the coproduct to a map 
r:Y=Z+2Z' > Z with rm = id. Thus r is epi, and by Lemma 4, r 
induces a map Epi(X,Y) — Epi(X, Z), so the result follows since in a 
topos any arrow to 0 is an isomorphism (Proposition IV.7.4). 


In particular, Lemma 5 implies that from Epi(2N, B) x ~ 0 it follows 
that Epi(2N, 2 5N) & = 0, so (6)indeed follows from (5). To complete the 
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proof of Theorem 2.1, it thus remains to show that (2) holds. This uses 
a special combinatorial property of the Cohen poset P. 

Recall that a topological space T is said to have the Souslin property 
if any family of pairwise disjoint (nonempty) open subsets is at most 
countable. Similarly, for an object X of a topos €, we say that X 
has the Souslin property if any family A of subobjects of X which are 
pairwise disjoint, i.e., for U, V € A, 


UAV=0 in Sub(X) whenever U 4 V, 


is at most countable. [One also says that the Heyting algebra Sub(X) 
satisfies the countable chain condition.] A Grothendieck topos E is said 
to have the Souslin property if it is generated (cf. §1) by objects having 
the Souslin property. 

For a Grothendieck topos €, we write S S for the functor Sets > 
E which is left adjoint to the global sections functor, as discussed earlier 
in this chapter at (2.11). So if (C, J) is a site for €, ie., if E ¥ Sh(C, J), 
then S is the associated sheaf of the constant presheaf AS: C°P — Sets 
with value S. 


Proposition 6. Let € be a Grothendieck topos. If€ has the Souslin 
property, then for any two infinite sets S and T, Epi(S,T) = 0 in Sets 
implies Epi(S, T) = SO0in€. 


Proof: Suppose to the contrary that Epi(S, T) 0 for such infinite 
S and T. This means that Epi(S, T) has at least two subobjects, hence 
at least two different arrows to 2. By our assumption on the generators 
of €, this means that there is a nonzero object X of € with the Souslin 
property and for which there is a morphism f: X — Epi(S,T). By 
Lemma 2, f corresponds to an epimorphism 


g=(m,f):XxSoXxTF 


over X in €. For two elements sEeS and ¢ € T, consider the corre- 
sponding arrows 3: 1 > Sand f#:1—3T in €, and use these arrows and 
g to construct the following pullback diagram i in E: 


Us, P, —_"» » X x12 X 


| | : 


x8 


Let W be the set {(s,t) € Sx T | Us4 #0}. We claim that for any 
given t € T, there is at least one s € S' such that (s, t) € W. To see this, 
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note first that S ¥ [[s € S1 in Sets, so Sx IIs € $1 in €, and hence, 
since X x (—) preserves coproducts [being left adjoint to exponentiation 
(—)*], there is an isomorphism 


[ [ex p2xxS. 
ses 


Since pulling back along P, —~ X x g preserves coproducts (by Theo- 
rem IV.7.2), this yields 
[ | Use & Pe. 


ses 


Furthermore, the map h: P, — X is epi since g is (by Proposi- 
tion IV.7.3), so P, is nonzero since X is. Since P; is the coproduct 
of the U,,, as just shown, at least one of the UV, must be nonzero. This 
proves that for any given t € T there is at least one s € S with U,. 4 0, 
as claimed above. In other words, 72: W — T is a surjection of sets. 

On the other hand, the vertical arrows U,;— X in the preceding 
diagram represent each U,, as a subobject of X, and for distinct ¢ and 
t’ one has U,. AUs,7 = 0. Indeed the square 


0 ———> 1 


| | 


1—,-T 


is a pullback in sets if t 4 t’; therefore since both functors’: Sets > € 
and X x (—): € > E preserve pullbacks as well as colimits (and hence 
preserve zero), the subobjects 1 x #: X x 1— X x Tand1x?:Xx1l—3 
X xT are disjoint. But then so are their pullbacks U,, and U,v. 

Since X has the Souslin property, it follows that for any given s 
the set W, = {t € T | (s,t) € W} is at most countable. Now S is 
infinite, so this implies that the cardinality of the set W = U,<g Ws 
equals the cardinality of S. Thus, there is a bijection of sets S = W, 
which when composed with 72: W - T gives a surjection of sets S »T, 
contradicting the hypothesis. 


The following result now completes the argument. 
Lemma 7. The Cohen topos has the Souslin property. 


Proof: The representable objects y(p) for p € P generate the Cohen 
topos Sh(P, ——), by Lemma 2.2. These are all subobjects of 1, so since 
the Souslin property is obviously inherited by subobjects, it suffices to 
show that 1 € Sh(P,—7) has the Souslin property. To this end, let 
{U; | i € I} bea family of nonzero subobjects of 1 such that U; nU; = 0 
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whenever i # j. We have to show that this family is countable. Since 
the objects of the form y(p) generate, we can pick for each index i some 
pi € P such that y(p;) < Uj. Then y(p;)Ay(p;) < UiNU; = Oifi ¥ J; so 
there is no r € P with r < p; andr < p,, i.e., p; and p; are incompatible 
conditions in P. (A set A C P consists of incompatible conditions if 
p#q in A implies that there is no r in P with r < p and r < qg.) Thus, 
Lemma 7 will follow when we prove the following fundamental property 
of P. 


Lemma 8. On the Cohen poset P, any set of incompatible condi- 
tions is necessarily countable. 


(One also says that P satisfies the “countable chain condition” be- 
cause it means that any “antichain” in P is countable!) 


Proof: Given a set A of incompatible conditions, let A, be the set 
of those conditions p in A for which the domain F, of p consists of 
exactly n elements. Then it clearly suffices to show that each A, is a 
countable set. This will be done by induction on n. For n = 0 there is 
nothing to prove. So suppose we have proved that any set of mutually 
incompatible conditions, each of n — 1 elements, is a countable set. To 
prove that A, must thus be countable also, write An =U, Anm, where 
Anm = {p € An | 3b € B such that p(b,m) is defined}. It is then 
enough to show that each An,m is countable. Pick for each p € An,m a 
bp € B such that p(b,, m) is defined, and write, for i = 0,1, 


An,m,i = {p € An | p(bp, m) =1 }. 


Since for any m and i, the elements of Ap,m,; are pairwise incompatible, 
so is the set of their restrictions, written p|..., 


Rami = {P| Fp — {(bp,m)} sp © Anmy }- 


This is a set of conditions on n — 1 elements, hence a countable set, by 
the induction hypothesis. Therefore Ay; is countable, and hence so is 
An,m = An,m,o U Anjm,i- This completes the proof. 


The argument in this section can also be profitably understood in 
reverse order. First, by the above combinatorial argument, the Cohen 
poset satisfies the countable chain condition (CCC). The elements p of 
this poset provide generators y(p) = Hom(—,p) for the presheaf and 
sheaf categories, and thereby suffice to prove that the latter category has 
the Souslin property, a direct reflection of CCC on subobject lattices. 
In turn, this property, viewed as a restriction of the “size” of P, suffices 
to establish the required preservation of cardinal inequalities. 
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4. The Axiom of Choice 


In this section we will construct another topos of double-negation 
sheaves 


F =Sh..(A) (1) 


which shows that the axiom of choice is independent of the other axioms 
of set theory. The site A for this construction was found by Peter Freyd; 
it gives the following result. 


Theorem 1. There exists a two-valued Boolean Grothendieck topos 


F with a natural numbers object N which has a sequence of objects 
Fo, A, Fy, ... such that 


(i) for each natural number n, the unique map F,, — 1 is epi, 
(ii) the product [],,, Fim exists and is the initial object 0, 
(iii) each F,, is a subobject of P(N). 


Observe first that the topos ¥ of (1), like any Grothendieck topos, 
indeed has a natural numbers object N. As noted in §1, this object 
can be constructed as the countable coproduct of copies of the terminal 


object: 
N=[]1. (2) 
nen 


The objects F,, of F are nonempty by condition (i) of the Theorem, 
but have zero product by condition (ii). This violates a familiar version 
of the external axiom of choice (AC). Now it is relatively easy to find a 
Boolean topos € in which AC fails, but much harder to find such a topos 
in which the internal axiom of choice (IAC) fails. But it is this axiom 
IAC which we wish to violate, to demonstrate the “independence” of 
the axiom of choice, since IAC for a topos F expresses the fact that the 
axiom of choice holds “in the internal logic” of the topos F, as will be 
explained in §6 below. 

The infinite product [],,, Fm used in Theorem 1 is to be constructed 
as follows: The unique maps F, — 1, one for each natural number m, 
combine, by the definition of the coproduct, to give a map 


Dp: [] m-JJ[1i2N (3) 


meEN nen 


to the natural numbers object N of (2). Now construct the pullback P 
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in 


where id is the transpose of the identity map N —N. Then for any 
object X of F, the universal property of this pullback square implies that 
there is a natural bijective correspondence between maps f: X — P in 
F and maps (transposes of kf) 


giNxXa [| ™ (5) 
neN 


such that pog=™m:NxX ON. But 


Nx x=([[)xx2]Jaxx2]]% 


neNn n 


so g is given simply by a sequence of maps gn: X — [],, Fn, while the 
identity po g = ™ above implies that g,, sends X into the summand F;,, 
of this coproduct. Applied with f: X — P replaced by the identity map 
1: P — P, this gives a sequence h,,: P — F,,, and by naturality h,f = 
9n- In short, a map f: X — P or g as in (5) is uniquely determined 
by a sequence of maps gn: X — F,, with hn f = gn. Therefore P is the 
infinite product [],, F, with the maps h,,: P — F, as the projections. 
Now by (i) of the theorem-—-which of course still has to be proved— 
each map F,, > 1 is epi. Therefore so is the map p: [],, Fm > L],1= 
N, as a coproduct of epis. Thus (ii) of the theorem, when proved, 
implies that the map pN occurring in the diagram (4) cannot be epi, 
since its pullback along id is 0 = P — 1, which of course is not epi. In 


other words, the functor (_)N does not preserve epis, and therefore, the 
internal axiom of choice fails in F. 

To transfer this result to the usual set-theoretic axioms, recall that 
the discussion in §2 of the continuum hypothesis mentioned that any 
Boolean Grothendieck topos F gives a model of Zermelo—Freenkel set 
theory, constructed by mimicking within F the standard formulation 
of the cumulative hierarchy. In this connection, it is important that 
the objects F, used above are “small”, as expressed by part (iii) of 
the theorem, so that they will lie inside the cumulative hierarchy so 
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constructed. (Incidentally, in the case at hand, the objects F,, for n > 0 
generate the whole topos F, so it follows that any object of F lies within 
this cumulative hierarchy.) 

We will find the following description useful. 


Lemma 2. In a presheaf category Sets©’, a subobject AC is 
dense for the ——-topology iff A meets every nonzero subobject B of C 
(in other words, iff B # 0 implies BN A # 0). 

More briefly: Dense means meets everything which is nonzero. 


Proof: Suppose first that the subobject A is dense for the ——- 
topology; i.e., that —7~A = C. Every subobject B of C’ then has 
B c —7A; hence, by the definition of negation, BN =A = 0. Since 
B #0 one cannot then have B c —A, hence, by the definition of nega- 
tion again, BN A# 0. 

For the converse, recall that AM—A = 0 in any Heyting algebra. 
Thus, if a subobject A meets every nonzero subobject, it follows that 
aA must be 0 and hence that ==A = C, so A is indeed —- dense. 


This proof also applies to any Heyting algebra, to show that sr = 1 
iff y # 0 implies y Az #0. 

We now prove Theorem 1. As a site for this topos F of (1), take the 
category A with objects all the finite sets of the form 


n= {0,1,...,n}, 


while a morphism f: n — m in A is any function from {0,1,...,n} to 
{0,1,...,m} with n > m and f(i) =7 for all i < m. In other words, 
A is the category of all nonzero finite (von Neumann) ordinals and the 
retractions of one ordinal to another smaller one. The following two 
simple properties of the category A will be crucial: 


(A) Homa(n,m) 4 0 iff n > m; 
(B) if two maps f, g: n > m of A fit into a commuting square of A 


then f = g. 


Now let Hn = Homa(—,7) be the representable functor given by 
n, and consider its sheafification 


FF, = a(Hn) = a(Homa ( om) n)), 
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where a is the usual sheafification functor 
a: Sets*” — Sh__(A) = F. 


We will prove that the sheaf category F is two-valued and that this se- 
quence F,, of objects in F satisfies the conditions (i)-(iii) of the theorem. 

To state that F is two-valued means that the only subobjects of 1 in 
F are the inevitable subjects 0 and 1. Indeed, in the presheaf category 
Sets*”, a subfunctor of 1 which is nonzero (and so equal to 1) at some 
integer n must also be nonzero at each m > n, because there is in A an 
arrow from m to n. Hence the subfunctors of 1 in presheaves are the 
empty functor 0 and the functors U, (n = 0,1,...) given by U,(m) =0 
ifm <n and U,(m) = 1 if m > n. Now since U,N Um D Umin, every 
Un meets every other U,,; ie., every Un, is dense. But by V.2.4, the 
only dense subobject of 1 which is a sheaf is 1 itself, so that among the 
U,,, the presheaf 1 = Up is the only sheaf. Thus 0 and 1 are the only 
subobjects of 1 in ¥, and F is indeed 2-valued. 

For the sheaf determined by a set S we adopt the notation of the 
previous sections and so write S + S for the composition of the functors 


A AP @ 
Sets — Sets” -— > Sh..(A) =F. 


Since sheafification is left exact and a left adjoint it preserves terminal 
objects and coproducts; hence 


T=1 2=141, (6) 


where 2 = {0,1} is a two-element set. Moreover, since ¥ is Boolean, 
its subobject classifier © is 1+1 (see Proposition 1.1), so, writing N for 
the natural numbers object of F [cf. (2)], we have 


P(N) = = J] 2. (7) 
neN 


Lemma 3. The subobject classifier 2 = 2 is an injective presheaf. 


Proof: Recall from (8) in §V.2 that for any topos € and any topol- 
ogy j on € the object 0, is a retract of the subobject classifier 2 for €, 
hence is injective in € since 2 is (§IV.10). The statement of the lemma 
is a special case, when € = Sets*” and j is the =—-topology. 


n~ 


Next we show that there is for each m a monomorphism F,, — P(N), 
as required for (iii) of the theorem. So consider any two distinct maps 
f#g9:n—™M in A and the induced maps 


fe =Hom(—,f), 92 = Hom(—,g): Hn > Hm = Hom(—,m). 
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Since f 4 g, property (B) above of the category A states that the images 
f» and g, are disjoint subfunctors of H,,. Each of them has a map to the 
terminal object. Since they are disjoint, their coproduct is their join, by 
Proposition IV.7.6, so this gives a map 


Im f, UIm 9s 3141=2. 


But 2 is an injective presheaf, so the map on this subobject extends to a 
map trg: Hm > 2: by its construction, this map gives different images 
for the two elements f and g of H,, = Hom(—,m). Now for any fixed n 
and m there are only a finite number of such pairs (f, g) with f # g, and 
there are only a denumerable number of finite ordinals n. Therefore, all 
these maps t+, combine to give a monomorphism 


of presheaves. But the product []2 is a sheaf, so the 7—-closure of 
this subobject H,, is its sheafification Fy, C P(N). Thus we have 
monomorphisms a 

Hy, Fm — P(N) 
where the first is dense, as required for condition (iii) of the theorem. 

Part (i) of the theorem now follows easily. Since Hom(—,n) = 
H,, > Fp, is a mono and Hy, is not empty, Fy, is not the initial object 0 
of F. Thus, if we factor F,, — 1 as an epi F, > V,, followed by a mono 
Vn 1, the image V,, cannot be 0. Since F is two-valued, this image 
must then be the terminal object 1 of F; in other words F;,, — 1 is epi, 
as required for Part (i) of the theorem. 

Finally, we prove Part (ii), that the infinite product P = [],, Fm 
constructed in (4) is 0. If not, there is some n with P(n) # 0, and 
the projection P — F,, then implies that F,,(n) #4 0 for all m; we 
will show to the contrary that Fnii(n) = 0 for all n. Otherwise, there 
would be an element of Fn41(m) and hence by the Yoneda lemma a 
natural transformation u: Hom(—,n) — Fy41. Now pull back the 
dense inclusion (sheafification) 7: Hom(—,n+1) — Fri along u to 
obtain a dense subobject @ of Hom(—,7), as in the diagram 


Q —*—— Hom( ~ ,n +1) 


| ; 


Hom(— ,2) ——yr— Frnt - 
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Here Q # 0, since by Lemma 2 the zero-object can never be dense. 

Because Q # 0 there is some natural number m with Q(m) C 
Hom(m,n) nonzero; choose in Q(m) such a g: m — n and let its im- 
age u’g in (8) beh: m > n+1. Therefore, m > n+ 1. Now by the 
description of the category A there are maps f, f/: m+1l—7minA 
sending m +1 ton+1 and to g(n+1) <n, respectively. It follows that 
go f=gof'. Therefore, 


hf=uw'(g)of (by definition of h), 


= u'(gf) (by the naturality of u’), 
=u'(gf’) 

=u'(g)of’ (again by naturality), 
=hf' (by definition of h). 


But Af(m+1) = h(n +1) =n+1 since n+1< m, while hf’(m+1) = 
h(g(n +1)) = g(n +1) <n. Thus hf # Af’, a contradiction. 

This shows that there are no maps Hom(— ,n) > Fr4+1, as asserted. 
The proof of Theorem 1 is now complete. 


5. The Mitchell-Bénabou Language 


Ordinary mathematical statements and theorems can be formu- 
lated with precision in the symbolism of the standard first-order logic. 
This symbolism starts with constants 0,1,2,...,a,b,c,... and variables 
£L,Y,2,-.. combined by the appropriate primitive operations so as to 
give terms such as x? or y + z. These terms enter into primitive re- 
lations <,=,... to yield formulas such as x < y or r+ y = z; then 
formulas are combined by propositional connectives (“and” , “or”, “not” 
and “implies”, written as A, V, =, =) and by quantifiers (Vz, for all 
and dz, there exists an x) to yield more complicated formulas. In the 
language of real numbers (or of natural numbers) such a formula might 
be 

(Vz)((Sy)(2 < y)A(a@<O0Va>0)). 


Alternatively, there are formulas in other languages such as the language 
of elementary geometry or that of set theory. It is by no means necessary 
or usual to explicitly state all theorems of interest in such a formal way. 
But there are at least four objectives for occasional such formulations, 
as follows: 


(1) They provide a precise way of stating theorems. 
(2) They allow for a meticulous formulation of the rules of proof of 
that domain, by stating all the “rules of inference” which allow 
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in succession the deduction of (true) theorems from the axioms 
for the domain. 

(3) They may serve to describe an object of the domain—a set, an 
integer, a real number—as the “set of all so and so’s” thus, in the 
language of natural numbers, {x | x? > 2} is a description of the 
upper half of a well known Dedekind cut. 

(4) They make possible a “semantics” which provides a description 
of when a formula is “true” (that is, universally valid). Such a se- 
mantics may specify what the formula “means” in terms of some 
domain of objects assumed to be at hand. Thus the formulas of 
Boolean algebra have an interpretation in terms of the subsets 
A, B,C,... of some assumed universe; the rules of the seman- 
tics involved will state, for example, that « € (AN B) iffx € 
Aand x € B, and so on. For Heyting algebras there is a corre- 
sponding semantics in terms of open subsets of a given topological 
space. 


These various purposes are interrelated. For example, the rules of 
inference in (2) are said to be “sound” when they yield semantically 
valid theorems from valid axioms; they are “complete” when they yield 
all such theorems. These notions are essential for basic results such as 
the famous Gédel completeness and incompleteness theorems. 

In such languages it is a common custom to use different lists of 
letters for variable elements of different sorts or types. Thus one uses 
m,n,... for elements of the the set N of natural numbers, z and y for real 
variables, z and w for complex variables, f and g for functions, and so on. 
For a topos E€ we will follow much the same procedure, by regarding the 
objects X,Y,... of € as the “sorts” or “types” and introducing a stock of 
variables for each type. We thus propose to describe a “language” (called 
the Mitchell-Bénabou language) for €; at the end of this section we will 
give a description of validity for the formulas of this language [point 
(4) above]. As for point (2), we will observe that the rules of inference 
appropriate to a general topos are precisely the standard rules for the 
first-order intuitionistic predicate calculus. This striking observation 
shows that these rules are supported by the geometrical aspects of sheaf 
topoi. 

Finally, as in point (3), we will show that formulas (2) in a variable 
zx of the Mitchell-Bénabou language can be used to specify objects of € 
by expressions of the form 


{x | d(z)} (1) 


—in the fashion common in set theory. This shows how a topos be- 
haves like a “universe of sets”. By using such expressions one can, for 
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example, mimic the usual set-theoretic constructions of the integers, ra- 
tionals, reals, and complex numbers and so construct in any topos with 
a natural numbers object the objects of integers, rationals, reals, ... 
(see 88 below). In §6 we will show how the work of Beth and Kripke 
in constructing a semantics for intuitionistic and modal logics can also 
provide a semantics for the Mitchell-Bénabou language of a topos €. In 
practice, this means that one can perform many set-theoretic construc- 
tions in a topos and define objects of € as in (1); however, in establishing 
properties of these objects within the language of the topos, one should 
use only constructive and explicit arguments. 

Let us now specify the (Mitchell-Bénabou) language of a given topos 
€. The types of this language are the objects of €. We will describe the 
terms (expressions) of the language by recursion, beginning with the 
variables. For each type X there are to be variables z, 2’,... of type X; 
each such variable has as its interpretation the identity arrow 1: X — X. 
More generally, a term o of type X will involve in its construction certain 
(free) variables y, z, w,..., perhaps some of them repeated. We list them 
in order of first occurrence, dropping any repeated variable, as y, z, w. 
If the respective types are Y, Z, W, then the product object Y x Z x W 
in E may be called the source (or domain of definition) of the term o, 
while the interpretation of o is to be an arrow 


o:YxZxwoxX 


of €. (In the event that o contains, say, two different variables y, y’ of 
the same type Y, its source will involve a corresponding binary product 
Y xY.) For simplicity, our notation will not distinguish between a term 
o (which is a linguistic object) and its interpretation (which is an arrow 
in the topos &€). 

Here are the inductive clauses which simultaneously define the terms 
of the language and their interpretation: 


e Each variable x of type X is a term of type X; its interpretation 
is the identity x =1: X — X. 

e Terms o and 7 of types X and Y, interpreted by o: U — X and 
7: V Y, yield a term (0,7) of type X x Y; its interpretation 
is 

(op,7q):W 3 Xx Y, 


where the source W has evident projections p: W — U and 


q: W — V. Here the notation ( ,  ) is used ambiguously, 
both for the new term and for the familiar map into the product 
XxyY. 


e Terms o: U — X and 7: V > X of the same type X yield a 
term o =7 of type Q, interpreted by 


(0 = 7): W ASP), X x X EQ, 
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where W and (op,rq) are as in the previous case, while 6x is 
the usual characteristic map of the diagonal A: X — X x X. 

e An arrow f: X — Y of € anda term o: U > X of type X to 
gether yield a term foo of type Y, with its obvious interpretation 
as an actual composite 

(tor U SX ay 

e Terms 0: V > Y* and o: U > X of types Y* and X yield a 

term 6(c) of type Y interpreted by 


6(0): W ——+ Y* x X —£Y. (2) 


where e is the evaluation and the map from W is (@q,op), much 
as above. 

e Terms 0: U > X andr: V > 9% yield a term o € 7 of type 2, 
interpreted as 


oer: WAP, x x OF —2 42, 


e A variable x of type X anda term oa: X x U > Z yield Azo, a 
term of type Z*, interpreted by the transpose of o, 


Azo: U = Z%. 


Here (but only here) we have used the notation from the d- 
calculus for exponential transposition. (Notice that in the term 
Axo, the variable x no longer occurs “free”; and, accordingly, the 
factor X has disappeared from the source of Az.) 


Terms ¢,~,... of type 2 will also be called formulas of the language. 
To such formulas we can apply the usual logical connectives A, V,=>, 7, 
as well as the quantifiers, to get composite terms, also of type 2. In 
principle, this has already been defined: the meet A: Qx Q — CQ given by 
the internal Heyting algebra structure of 2 [see IV.6(3)] gives for terms 


$:U > Qand p: V > Qanew term Ao (4,4): W 9 2x24Q, 
by the clauses above. As usual, we will denote this term more briefly as 
@Aw. The same procedure applies to the other propositional connectives. 
ae ( op,0a) 

Av: WS 2x 9-—A 2, 


ove: W ( oP, ) 2xQ Vv Q 


o>: Ww (op,va) axa => Q, 


@ 


ao: W —*— 2 —=—.. 0 


2 
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Next we interpret the quantifiers: suppose ¢(z, y) is a formula containing 
a free variable x of type X, and others y,... which together give a source 
XxY € E€asabove. Then ¢(z, y) is interpreted by an arrow X XY > 2 
of €. The familiar logical formalism yields a formula 


vag(z,y) (3) 


which no longer contains the variable x as a free variable, hence should 
be interpreted by an arrow Y — 2. This can be done as follows: consider 
the unique map p: X — 1, the induced map P(p): Pl — PX, and its 


internal adjoints Vv 
SE Piata 


P(p) 


Ap 
oO 


as in 81V.9 Theorem 2 and Proposition 4. Now the formula ¢(x, y) gives 
a term Ar¢(x,y): Y ~ O* = PX, and hencea term VpoArg(z, y): Y > 
Q. We simply regard Vr¢(x, y) as shorthand for V, ° (Az@(z,y)). Exis- 
tential formulas Azr¢(z, y) can be treated in exactly the same way. [As 
usual, we will often make the type of the quantified variable explicit, 
and write Vz € X (2, y), ae € X o(x, y) for Vrd(x, y), Srd(z,y). Here, 
read xz € X as “x has type X”.] 

If ¢(z,y) is a formula with free variables x, y, we write { (x,y) | 
o(x,y) } or {(2,y) © X x Y | (x, y) } for the subobject classified by its 
interpretation; this means that this subobject is the corner of a pullback 
diagram 


o* = PX P1=Q9, 


{ (x,y) | o(z, y)} ——— 1 


| |v (4) 


With this convention, we can write the usual expressions such as {< | 
¢(x) } to denote subobjects of a given object X, just “as if” the object X 
of the topos € had elements z. In other words, { x | (x) } is a notation 
for the “extension” of the formula ¢(x) in the topos €. 

The interpretation of the quantifiers Vz and Jz, as explained above, 
can alternatively be described as follows. Write ™: X x Y — Y for 
the projection, and consider the (external!) adjoints to 7~1: Sub(Y) > 
Sub(X x Y): 


Va: Sub(X x U) > Sub(U), dy: Sub(X x U) > Sub(V). (5) 


Then the subobject {(x,y) € X x Y | ¢(z,y)} € Sub(X x Y) yields 
subobjects Vx({(z,y) € X x Y | d(z,y)}) and ar({(z,y) © X x ¥ | 
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o(x,y) }) of Y, and it follows from the definitions that these are precisely 
the subobjects of Y corresponding to the formulas Vz € X ¢(z,y) and 
dz € X ¢(z,y). In other words, these subobjects are given as in (4) by 
pullbacks 


——____—— () ——_——-__- Y. 
is Vad(z,y) , Ar¢(z,y) Me 


Next, we describe truth (more modestly, we usually say “validity”). 
A formula ¢(2,y) of the language of a topos is said to be universally 
valid in € if the arrow ¢(z, y): X x Y — 2 which interprets this formula 
factors through true: 1 — Q); briefly, the formula is true. In other words, 
¢(z,y) is universally valid in € iff the subobject 


{(2,y) | ¢(t,y)}> Xx Y 


is in fact the largest subobject X x Y itself. If ¢ is a formula without free 
variables, its interpretation is an arrow from the empty product 1 into 2. 
Then ¢ is valid iff 6: 1 — | coincides with the arrow true: 1 — 9. Then 
one also says that ¢ holds in E, or ¢ is true in E, etc. Also, a formula 
¢(x,y) with free variables x, y of types X and Y is universally valid in 
€ iff the universally quantified formula VrVy¢(zx, y), whose source is 1, 
is valid in €. 

As stated at the beginning of this section, this language can con- 
veniently be used to describe various objects of €. For example, the 
“object of epimorphisms” , 


Epi(X,Y) > Y%, 


constructed in §3 for given objects X and Y of a topos E, can be de- 
scribed by the expected formula, involving variables x, y, f of types X, 
Voy: 


Epi(X,Y)={f eY* | vye Yar e X f(x) = y}. (7) 


More explicitly, we claim that the subobject of Y* defined in this section 
via the language of € coincides with the subobject Epi(X, Y) defined in 
purely categorical terms. Similarly, exploiting the definition of validity 
just given, the language of € can be used to express properties of a topos 
€. For example, € is Boolean iff the formula 


Vp(p V =P) (8) 
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holds in €, where p is a formula of type N—the “type of truth-values”. 
And € satisfies the internal axiom of choice (IAC), that is 


(—)®:€ —€ preserves epis (9) 


for all objects E of €, iff the expected formula (for arbitrary objects X 
and Y of &) 


Vf EY*(vyeYire X f(x)=ys> 
age X* Wye Y f(g(y)) =y) (10) 


holds in €. 

In principle, it is possible to prove the equality of the two Epi(X, Y) 
subobjects above and at (7), as well as the equivalence of (8) and 
Booleanness, as well as that of (9) and (10), by unwinding the defini- 
tions. Although straightforward, such unwindings often become lengthy 
and cumbersome. Therefore we propose to postpone the proofs of (7), 
the equivalence between (9) and (10), and the equivalence between (8) 
and Booleanness, and first develop a convenient notation for the seman- 
tics suitable for a topos—the so-called Kripke—Joyal semantics. 

To end this section, we make some general remarks about the prop- 
erties of truth. The customary method of deriving new valid formulas 
from given ones can be carried out just as for “ordinary” mathematical 
proofs, using the variables as if they were actual elements, provided that 
the derivation is explicitly constructive. For a general topos, one cannot 
use indirect proofs (reductio ad absurdum) since the rule of the excluded 
middle (@ V =@) need not be valid, nor can one use the axiom of choice. 
More technically, this means that the derivation is to follow the rules of 
the intuitionistic predicate calculus. The details, including a specifica- 
tion of the rules of this calculus, can be found in [Boileau, Joyal] or in 
[Lambek, Scott]. For our presentation we do not need these rules, but 
only the semantics. 


6. Kripke—Joyal Semantics 


Semantics, in our usage, will turn a formula of some formal language 
into a statement in ordinary (naive) language. Thus, the connective V 
means “or”. Stated more carefully, this concerns two formulas ¢(x) and 
a(z) in a free variable x and says for an element a that 


a €{x| 4(x) V U(x) } means 4(a) or ¥(a) 


—and similarly for other connectives and quantifiers. 
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The corresponding rules for a topos € must be more careful. First, 
the variable x must have some object X of € as its type. Second, the 
“element” a must be a generalized element a: UJ — X with some domain 
U. And third (here intuitionism enters!), the “vy” must be documented 
by giving an object V where ¢(a) holds and W for y(a), such that V+W 
gives all of U [rule (ii) below]. Thus, for a generalized element a: U — X 
of an object X in a topos €, there are “semantical” rules which specify 
when this generalized element a belongs to a “subset” (a subobject of 
X) described as {x | d(x) }. Here, ¢(z) is a formula of the language of 
E in one free variable x of type X. These rules depend on the way the 
formula ¢(x) is built up from connectives and quantifiers. These rules 
are usually formulated in terms of a relation “U forces ¢(a)”, defined in 
(1) below and written 


U It o(a). 


Here, ¢(a) denotes the result of formally replacing all the (free) occur- 
rences of x in the formula ¢ by a. [Instead of U lt ¢(a@) one also writes 
more explicitly U lt 6(2)[a] or U It d(2)[a/z].] 

For any generalized element a: U — X with its image Ima € 
Sub(X) one defines 


Ult g(a) if Ima<{a| (z)}. (1) 


In other words, U It $(a) iff a factors through {2 | @(x)}, as in the 
diagram 


{x | ¢(x)}——— 1 
ra | |v (2) 


U a > X jay 


For formulas such as ¢(z,y) or ¢(x#,y,z) in additional free vari- 
ables, one defines forcing in a similar way; thus, for generalized ele- 
ments a: U > X and 8: U — Y, we will say that U forces (a, 8) 
liie., U forces O(x,y) where x and y are interpreted by ao and (J, in 
notation U It g(a, 8), iff the map (a,8): U — X x Y factors through 
{ (x,y) | 6(2,y) }—> X x Y. The extreme case is that in which ¢ does 
not contain any free variables at all. Then for an object U of €, one 
has U \|t ¢ iff the unique map U — 1 factors through the subobject 
{- |} 1 classified by ¢: 1 — 2 (indeed, as a term of the language, ¢ 
has the empty product 1 as its source, hence is interpreted by an arrow 
g: 1— Q). In particular,this means for such a variable-free ¢ that 


lit @ iff@=true: 1-9. 


304 VI. Topoi and Logic 


As another example, consider two arrows a, 8: U > X. Then the 
formula a = 8: U > X is by definition interpreted as 


u 48), x x x X40, 


where 6x, the Kronecker delta for X, is the characteristic function of 
the diagonal A: X — X x X. Thus U It a = @ means by (2) above that 
(a, @) factors through A; in other words, that 


Ulta=f6 iffa=8:U 5X. (3) 


The following two properties of the forcing relation follow easily from 
the definition: 


Monotonicity: If U lt ¢(q@), then, for any arrow f: U’ > U in E, also 
U' It (ac f). 

Local character: If f: U’ — U is epi and U’ It ¢(ao f), then also 
U Ik d(a). 

Here a: U — X is a generalized element and ¢(z) is a formula with the 

free variable x of type X. Of course, monotonicity and local character 

also hold for formulas with more than one free variable (or with no free 

variables). As to the proofs of monotonicity and local character: the 

first is obvious. For the second, consider the two pullback squares 


Q——» P_—_—+ {2 | g(x) } 


Ul ——» U ———> X. 


f 


By assumption, of factors through {xz | ¢(x)}; since Q is a pullback, 
this implies that the mono n: Q — U’ has a section s: U'’ — Q with 
ns = 1. Thus 7n is also epi, hence iso. It follows that f on is epi, and 
hence so is m. Thus the mono ™ is an iso, which means that a factors 
through {x | ¢(x) }. 

The behavior of the forcing relation lt with respect to logical con- 
nectives and quantifiers is summarized by the following theorem. (For 
convenience, this is formulated for formulas with just one free variable 
x of type X. But see the remark below the proof of Theorem 1.) 


Theorem 1. Ifa: U — X is a generalized element of X while 6(r) 
and 7)(z) are formulas with a free variable x of type X, then 


(i) U Ik f(a) A (a) iff U It O(a) and U Ik (a); 
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(ii) U It d(a) V Y(@) iff there are arrows p: V > U andq: W =U 
such that p+q: V+W-—U is epi, while both V lt ¢(ap) and 
W Ik $(aq); 

(iii) U It o(@) => (a) iff for any arrow p: V — U such that V It 
o(ap), also V It (ap); 

(iv) U lt ~é(a) iff whenever p: V — U is such that V lt (ap), then 
V =0. 


For the quantifiers, consider a formula ¢(z,y) with an additional free 
variable y of type Y. Then 


(v) U |t Syd(a, y) iff there exist an epi p: VU and a generalized 
element 3: V — Y such that V lt ¢(ap, 8); 
(vi) U lt Vyd(a, y) iff for every object V, for every arrow p: V =U 
and every generalized element G: V — Y one has V lt ¢(ap, 8). 
For the universal quantifier, one also has 


(vi!) U It Vyb(a,y) iff U x ¥ It b(am, 7). 


This last clause strengthens (vi), replacing the arbitrary object V by 
the product U x Y and the arbitrary @ by the projection m2: UxY = Y; 
for this reason 72 may be called a “generic” element of type Y. 

Observe that clause (vi) here establishes a relation between forcing 
and “truth”. Indeed, a formula ¢(x, y) was said to be universally valid 
iff the interpretation 


(vr) (Vy)d(z,y): 13 2 


is the arrow true: 1 — {. Hence, by the observation above about the 
meaning of forcing for formulas with no free variables, one has 

(vii) @(x, y) is universally valid iff 1 It (Vx)(Vy) (2, y). 
Note also that the clause (ii) embodies the intuitionistic rule for disjunc- 
tion. 

Proof of Theorem: The first rule (i), that for A, is a straightfor- 
ward one: If the arrows ¢(z) and W(x): X — 2, respectively, classify 
the subobjects {x | ¢(x)} — X and {x | ¥(x) } > X then (by definition 
of A: Qx OQ > O) their meet, 


{x | d(z)} Ate | v(z)}, 
in the lattice Sub(X) of subobjects, is classified by A ° ( ¢(z),¥(xz)) = 
(d(x) A(x)). In other words, there is a pullback 


{x | d(z) A ¥(x) }}—— {2 | d(z)} 


| | 


{zx | v(x) }-——___, x, 
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which by the definition (2) makes the equivalence (i) obvious. 

The other clauses follow in a similar way from the correspondence 
between V,>: 2x 2 — Q and =: Q — 1 and the Heyting algebra 
operations in Sub(X). For example, for (ii), if O(z) and ¢(x) classify 
{x | d(x)}—X and {x | 4(x)}—X, respectively, as before, then 
(2) V p(z) = V(e(z), (2)): X 9x ~. 0 classifies the supremum 
of the two subobjects {z | ¢(x)} and {2 | #(x)}. This supremum is 
constructed (as in §IV.6) by factoring r: {z| ¢(z)}+{2|d¢(z4)}— x 
as an epi s followed by a mono, as in the diagram (5) below. Now 
suppose a: U — X factors through {x | ¢(x) V ~(x)} via the map 6B 
displayed in (5). Let p: V — U and g: W — U be the pullbacks of 
{xz | d(x) } and {z | o(x)} along a: U — X: 


eee 


———— > ¢ ee 


Now pullbacks preserve coproducts (§IV.7), so V + W is the pullback of 
s along a, as in the diagram (5). Hence p+ g: V+ W — U is epi, since 
pullbacks preserve epis (Proposition IV.7.3). 


V+W —- {2| d(x) }+{2| v(x} 


yu —42.—+ {2 | d(x) vo(2)} (5) 


| | 


a we 


Moreover, ap and ag evidently factor through {xz | d(x) } respectively 
{x | (x) } as in (4), so V It d(ap) and W It (ag). This proves one 
direction in (ii). 

Conversely, suppose there are maps p: V — U and q: W > U such 
that V+W — U is epi and V it ¢(ap) and W | ¢(ag). This means that 
there are two commutative squares as in (4) (but no longer necessarily 
pullbacks), which together give a commuting square, with r the right- 
hand composite of (5): 


V+W — + {2x| d(x) }+{2| v¥(@)} 
rl | (6) 
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Since p+ q is epi, it now follows that the image of a is that of the 
composite ry. Hence the image of a, and therefore a itself, factors 
through the image {x | ¢(x) V y(x)} of the right-hand arrow r, as 
displayed in (5). 

Next we prove the rule (iii) for implication. First assume that U It 
g(a) => (a) and take any p: V — U such that ap factors through 
{x | (x) }. Since it also (like a) factors through {x | 6(x) > p(x) }, it 
follows that it factors through {x | W(x) }, by the law (6 > c)Ab<c, 
valid for objects b and c in any Heyting algebra. 

Conversely, let M be the image of a: U — X and pull {z | ¢(z)} 
X back along the factorization of a, as in the diagram 


V ——» Mn {2| o(x)}———> {2 | d(x) } 


This determines an object V and a map p: V > U such that V lk ¢(ap). 
By hypothesis, we then have V It ¢(ap). But by the diagram, ap has 
image M1 { x | ¢(x) }; so by the definition of forcing 


MN {x|d(z)}<{2| Y(z) }- 


By the definition of implication >, this gives M < {x | d(x) > (2) }. 
Hence, again by the definition of forcing, U lt ¢(@) > w(a) since M is 
the image of a. 

Leaving the result for the remaining connective — to the reader, we 
turn to the quantifiers and consider first the clause (v) for the existential 
quantifier. Now jy¢... is shorthand for 4, 0 Ayd... so by the defini- 
tion of 3, (Proposition IV.6.3), the object {x | Syd(z,y) } fits into a 
commutative square 


{ (x,y) | d(a,y) }——>- X x Y 


|» (7) 


{x | Sy¢(x, y) }-—_-. x. 


For the implication from left to right in (v), suppose that a: U — X 
is such that U lt Ayd(a,y); this means that by definition a factors 
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through { x | Sy¢(z, y) } as in the bottom row of the following diagram, 
with Sy¢(x, y) as described by the square (7): 


| | i 


ca UC) Pee 


 — a nS, 


Now take the pullback of the middle vertical epi, as on the left above. 
Since pullbacks of epis are epi, this gives an epi p: V — U. The com- 
posite map to the factor X is ap, while that to Y, via the projection 7, 
is some generalized element 8: V — Y. Together they provide a factor- 
ization of (ap, 8) through { (x, y) | ¢(z, y) } and so by the definition of 
forcing give V It ¢(ap, 3), as required for (v). 

For the converse, suppose we are given an epi p: V-—»U and an 
element 3: V — Y such that V It d(ap, 3); that is, such that (ap, 3) 
factors through { (x,y) | (x,y) }. With (7), this gives a commutative 
square 

(ap,B) 


a ee 


V ——> {(z,y) | o(z,y) } ——> X x Y 


: {2 | Sua(z.9) } ” 
| 
U ——_.——> x. 


Since p is epi, it follows that the image of a is contained in that of 
the composite ap and hence, by the commutativity of this diagram, in 
{x | Sy¢(z,y)}. Thus, @ factors through this subobject, so one has 
U It Ayd(a, y). 

Finally, we consider the case of the universal quantifier. First notice 
that (the two right-hand sides of) (vi) and (vi') are equivalent. Indeed, 
for these right-hand sides, (vi’) is a special case of (vi), where V = Ux Y 
while p and £ are the projections; so (vi) implies (vi’). Conversely, 
suppose that U x Y lt d(am, 72), ie., thatax1:UxY xXx Y 
factors through { (x, y) | ¢(z,y) }. Then for any p: V ~ U and 8: V > 
Y, one has (ap,3) = (ax 1) 0(p,f); hence, (ap,@) factors through 
{ (x,y) | O(z,y) } since a x 1 does. 
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It thus suffices to prove the equivalence (vi'). Now, by definition 
§5(6), {x | Vyd(x,y) } = Vr{ (2, y) | o(z,y) }, where 7: X x Y > X is 
the projection, and 


Va: Sub(X x Y) > Sub(X) 


is right adjoint to pulling back along 7. By this adjointness, one has for 
any subobject A> X that A < {x | Vy@(a,y)} iff Ax Y < {(2,y) | 
¢(“,y)}. Now for any generalized element a: U — X, with image 
Im a, in the sense of 8IV.6, the product a x ly is the pullback of a along 
a: XxY — X and pullback in a topos preserves epis and monos. Hence, 
the image of a x 1 is the product Ima x Y, while by the definition (1) 
of forcing 


U lt Vyd(a,y) iff Ima< {2 | Vy¢(z,y) } 


and, by the adjunction above, iff Ima x Y < {(z,y) | o(z,y)} which 
by (1) again means that U x Y It ¢(@7m, 72). This proves (vi’) and so 
completes the proof of Theorem 1. 

Formulas with more (or fewer) free variables satisfy clauses similar 
to those stated in Theorem 1, as we have already noted. For example, 
if y is the only free variable in the formula ¢(y), then clauses (vi’) and 
(vi) for any object U in € become 


U It (Wy) d(y) iff Ux ¥ It 6(72) 
if for any p: V >U and B: VY, Vt d(8). 


There are also mixed forms: for example, if ¢(x) and ~(z, y) have free 
variables as indicated, while a: U — X and 6: U — Y are generalized 
elements, then 


U Ik (a) Ad(a, 8) iff U lt d(a) and U It Y(a, 8). 


This is a typical example of the general pattern of such “mixed” cases. 

The rules of the Kripke—Joyal semantics, applied in succession, serve 
to translate “forcing” statements about € into ordinary assertions about 
€. Thus, for example, consider for objects X, Y in € a formula ¢(x, y) 
in the language of €, where x and y are variables of type X and Y, 
respectively, while a: V — X and @: V — Y are generalized elements. 
Then the rules of Theorem 1 applied in succession to the various logical 
connectives used in ¢ replace the forcing statement 


V Ir (a, B) 


by a statement about V, a, @, etc., in the ordinary external language 
used to discuss €. Since the formulas of € are built up via the logical 
connectives from “primitive” formulas involving = and membership, the 
rules of Theorem 1 are of course to be supplemented by two rules for 
these cases as follows. 
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Proposition 2. If a(x) and r(x) are terms of type Y in the free 
variable x of type X, while a: U — X is a generalized element of type 
X in €, and o’ and 7’ are the interpretations of o and 7 as arrows, then 


Ulko(a)=7(a) iffo'a=7'a: UY. 


This is proved as for (3) above. 
A corresponding proof, using the membership relation 


My — Y x o*. 
defined as that subobject with characteristic map ey: Y x NY — Q, will 
yield the evident rule: 


Proposition 3. If o(z) of type Y and r(x) of type QY are terms of 
the language of € in a free variable x of type X, while o’ andr’ are the 
corresponding interpretations as arrows and a: U + X is a generalized 
element, then 


U lt o(a) €r(a) iff (o'(a),7'(a)): U + ¥ x OY 
factors through My > Y x OY. 


To illustrate the use of forcing, let us return to the formulas discussed 
at the end of the previous section, beginning with the description of the 
object of epis by the intuitively plausible formula (7) of §5: 


Epi(X,Y) ={f © Y* | vy © Y ar € X f(z) = y}. (10) 


Now the definition of this object Epi(X,Y) as given in §3 means, by 
Lemma 3.2 there, that an arrow a: U — Y* of € factors through the 
subobject Epi(X,Y) = Y* iff 


(m,@):Ux X +UxY is epi in €, (11) 


where @: U x X — Y is the transpose of a. On the other hand, the 
definition (2) of forcing states that this a factors through the right-hand 
side of (10) iff 
U |k Vyara(x) = y. (12) 
Thus, we must show (11) equivalent to (12). Let us unwind (12) by the 
rules of Theorem 1. By rule (vi’) it becomes 
U x Y It ax(am) (2) = me 


and thence, by rule (v) for the existential quantifier, that for some epi 
p:V—>Ux/Y and some 8: V > X 


V Ik (am p)(8) = map. (13) 
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Now, by the interpretation (3) for forcing of an equality, this means that 
the two maps on the right of (13) are equal. Since the left-hand map 
involves terms of the form 0(8) with 0 = amp, it is interpreted as in 
§5(2) by an evaluation e( amp, 3). Thus the equality in (13) means that 
the diagram 


peels i yt eX 


U x Y —___,__ Y 


commutes, and hence, by the formula & = e(a@ x 1) for the transpose of 
a, that 


V (™p,8) Ux X 


_ | (14) 


UxY 


commutes. But the existence of such p and @ making (14) commute 
implies that (7,@): U x X + Ux is epi as in (11). Conversely, if the 
latter map is epi, one may choose V = U x X and p = (m1, @), B = 7. 
For this choice, (14) commutes, so U | Vyara(x) = y as in (12). This 
proves (11) and (12) equivalent. 

As a second example, we prove for a topos € that 


E is Boolean iff Vp ¢ (pV —p) holds in E, (15) 


as asserted in §5(8). Here p is a variable of type 1. To prove (15), we 
unwind the right-hand side. Since p is a free variable (hence, a term) 
of type 2, p is also a formula of type 2 containing one free variable (p 
itself), so p defines a subobject 


{p|p} 2 (16) 


[this is of the usual form {z | d(x) }, but x = p= ¢(x)!]. The interpre- 
tation of p is the identity 2 —> 1, so since {p | p} is by definition the 
pullback 


{p|p}——— 1 


a: 


Os = 25g 
id 
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t 
we see that {p | p}>—> 1 is the subobject 1 = a. Consequently, for 
any arrow a: U — 1) (a generalized element of type 2) corresponding 
to a subobject A— U, we have by the definition (2) of forcing, 


Ulta iff @ factors through 1 kes 2 
iff A= U. 


Now suppose that Vp € 2(pV —p) holds in €. This means that 1 It 
Yp € Q(p Vv ap). But by (vi) of Theorem 1, this is equivalent to: for all 
U and alla: U +9, U lt aV 7a. Thus, by (ii) of Theorem 1, there 
are g: V > U andr: W — U such that V+ W -— U is epi, V lt ag, 
and W |t -(ar). Let @ classify the subobject A—U. Then V It ag 
iff g~1(A) = V iff g(V) < A, where q(V) is the image of q as in the 
factorization 


V———> dV) 


oN 


U. 


On the other hand, by clause (iv) of Theorem 1, W It (ar) iff whenever 
some s: W’ — W has the property that W’ It ars, then W’ & 0; 
that is, whenever rs factors through A, then W’ & 0. This means 
that r~1(A) © 0, so that r(W) < =A where r(W) is the image of r 
[defined just as for g(V)|. But g+r:V+W — U is epi, soU = 
q(V) Vr(W) < AV-AA. This shows that the subobject AU has a 
complement, namely, =A. Since U and @ are arbitrary, we conclude that 
E is Boolean {cf. Proposition 1.1(iii)]. This proves the implication from 
right to left in (15). The other direction is similar but easier, and left to 
the reader as Exercise 9. 

As a final example, consider the formula for a topos € which states 
that every epi in € has a cross section 8, as in 


Vf € Y* [(Vyae f(x) = y) > ds € X* vyfs(y) = yl. (17) 


We will prove that this formula is valid in € iff € satisfies the internal 
axiom of choice (IAC); in other words, the validity of (17) is equivalent 
to the property that for any epimorphism X — Y in € and any object 
E of €, the induced morphism X¥ — Y® is again epi. 

In one direction, suppose that € does satisfy IAC. We need to show 
that (17) is valid in €; or, equivalently, that 1 It (17). By Theorem 1, 
parts (vi) and (iii), the latter means that for any object U of € and any 
a:U + Y* 


U It Vyaea(x) = y implies U Ik 3s € XY Vy € Yas(y) = y. (18) 
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Of course, a(x) here is to be interpreted as in §5(2) by evaluation as 
e((a,2)), while on the right the composite in as(y) is also read as an 
(iterated) evaluation. So take such U and a with the property that 
U |t Vyaaa(r) = y. We have just shown, in (11) and (12), that this 
means that (7™,@): U x X +U xY is epi. Now we wish to construct 
for s in (18) a map from Y, say in (U x X)¥. But by IAC, the induced 
map (U x X)¥ — (U x Y)* is epi, and hence so is its pullback r along 
the transposed 1: U — (U x Y)Y of the identity, as in the diagram 


p—*4(y x xy¥ —™, xY 


‘| [evs (19) 


U4 (Ux Y)Y. 
1 
The transposed diagram (with projections a2 and m2 added) 


Ye  pyy—t>Ux x —™ XxX 


ra| ken a) (20) 
U x Y — jy U x Y. 


is also commutative. Let 9 be the composite t¥ og: P + (Ux X)* > 
XY, as in the top row of (19). Then the commutativity of (20) and the 
naturality of transposition gives 


mg=r, TG=8, hence G=(r,). (21) 
But the definition of the transposed via evaluation gives 
B=e(B x 1) =e(B,02) = B(o2): Px Y > X, (22) 


where the evaluation e( 3,02) is the interpretation of the term G(02) 
much as in (13) above or in §5(2). Hence, composing both legs of (20) 
with the projection U x Y + Y gives 02: Px Y 3 Y as og = GG = 
e(a x 1)¢ and hence, by (21), 


02 = e(a x 1)(r, B) = e(ar x 1)(1,8) = e(ar, 8). 


The evaluation at the end is again the interpretation §5(2) of the term 
(ar)(8(o2)). Thus, for the generalized element o2: P x Y + Y we have 


PxyYtMr (ar) (8(o2)) = 02. 
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Hence, by rule (vi’) with ar: P—» Y* and 8: P > XY, 


P Ir (Vy)(ar)(B(y)) = y- 


Since r: P — U is epi we conclude by the clause (v) for the existential 
quantifier in Theorem 1 that 


U Ik As € X* Vy a(s(y)) = y. 


This shows that (17) is valid in € whenever € satisfies IAC. If the patient 
reader examines this proof again, it will appear that the argument has 
indeed been aimed at this end. 

Conversely, suppose that (17) is valid in €; that is, that (18) above 
holds for all objects U and generalized elements a: U + Y~% in €. 
We shall then prove that € satisfies the internal axiom of choice; i.e., 
that for every epimorphism y: X —» Y and every object U the map 
y¥: XU _, Y" is again epi. So consider the composite 

GY? el Sey, 


where ¥: 1 — Y% is the transpose of the given y. Then the transpose 
of @ is 


@= ym: YU x XY (23) 


and so (m,@): YY x X + YY xY is just 1x-, hence is epi. Therefore, 
by Lemma 3.2, a factors through the object of epis, Epi(X,Y) =~ Y~. 
As we have just shown, in (10) above, this means that 


YU It Wy € Yaa € Xa(x) = y. 
Now apply (18) above with U there replaced by YY. This yields 
YY |k 3s € X* Wy € Ya(s(y)) = y. (24) 


Unwinding the assertion (24) by clause (v) of Theorem 1 gives an epi 
p: V + YY and some generalized element ¢: V > X* such that 


V Ik Wy € Y (ap)(o(y)) = y- (25) 


Now apply clause (vi) of Theorem 1 using the maps 7,: V x U — V and 
p:V xU + Y. This gives 


V x U lt (apm)(om(B)) =P. (26) 


Here, the “evaluation” (o7)(p) is computed as the composite e( 071,)), 
call it 


6 =e(om,p): Vx U AEP V x Y 2 XY xVY 25 x, 
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while the evaluation (apm,)(@) is similarly computed as e( apm, 0), or 

v xu 2223 (vx) x XRAY x x eAL Xx X24 Y, 
But, by (23), e(a x 1) = & = y79, so this whole composite is just y@ and 
(26) with p = e(p x 1) makes the diagram 


V xU— 2-4 Xx 


YY x U—,» Y 


commutative. The transpose of this diagram is p = 6: :V > Y"; since 
p here is epi, so is y’. Thus y epi implies any y¥ epi. This is IAC, as 
desired. 


7. Sheaf Semantics 


The Kripke—Joyal semantics of the previous section takes a more 
explicit form when it is applied to a Grothendieck topos; that is, to a 
topos € of sheaves for a site (C, J)—a category C with a Grothendieck 
topology J. Recall that there are functors 

Cc —». Sets®” == Sh(C, J) = &; (1) 
where a is the associated sheaf functor, 7 is the inclusion, and y is the 
Yoneda embedding. The essence of the corresponding semantics is that 
the Yoneda lemma allows us to use for a sheaf X only those generalized 
elements of X in € which are of the form a: ay(C) — X for some 
object C of C. Since a is left adjoint to the inclusion, such a generalized 
element a of X in € is uniquely determined by the map an: yC > X, 
where 7: yC —» ayC’ is the universal map of the presheaf yC' into a 
sheaf, as in §III.5. Furthermore, by the Yoneda lemma such a morphism 
an: yC — X corresponds to a unique element of the set X(C): 


X(C) = Hom(yC, ix) = Hom¢g(ayC, X). (2) 


In formulating the Kripke—Joyal semantics, we will thus use for a sheaf 
X only the elements a € X(C) for various objects C € C, and view such 
elements as generalized elements ay(C) —» X in €, by the isomorphisms 
of (2). 

Let ¢(z) be a formula in the language of the topos €, with a free 
variable x of type X. As in the previous section, we will define a forcing 
relation “C' lt ¢(a)”, where C is an object of the site C and a € X(C): 


Clk oa) iff (i) a € {x | d(x) }(C), 
iff (ii) a: yC — X factors through {x | @(z)} >> X, 
iff (iii) a: ayC — X factors through {x | d(x) } — X. 
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Here {x | (x) }—> X is the subobject of X in € defined from ¢(z) as 
in §5. In clause (i), we regard {x | ¢(x)} as a subsheaf of X, so that 
for each C' € C, the set {x | d(x) }(C) is a subset of the set X(C). In 
clause (ii), we have identified a € X(C) with an arrow a: yC > X, via 
the Yoneda lemma; in (iii), we still write a for the corresponding map 
ayC — X, as in (2). 

Since { x | (x) } is a subsheaf, it is closed under restrictions; that is, 
if f: D —> C isa morphism of C and a € X(C), thena € {x | ¢(x) }(C) 
implies that a- f € {x | $(z)}(D) C X(D). In the forcing notation, 
this gives the monotonicity property (analogous to the one stated at the 
start of §6): 


Monotonicity: If C lt ¢(@) and f: D—C then D|k d(a- f). 
Similarly, the local character property takes the following form: 


Local character: If { f;: C; —> C’} is a cover in the topology J such 
that C; lt ¢(a- f;) for all i, then C Ir ¢(a). 


Notice that this local character property of the forcing relation is simply 
a reformulation of (1) in §IIL8, applied to the subsheaf {x | d(x) } of 
X. (Alternatively, one may derive local character as stated here from 
the local character property of §6, by using Corollary III.7.7.) 

The Kripke—-Joyal semantics may now be restated in the following 
form. (Again, similar clauses of course hold for formulas with more free 
variables. ) 


Theorem 1. For a Grothendieck topology J on C, let X be a sheaf 
on C, while (x), (x) are formulas in the language of the topos E = 
Sh(C, J) ofsheaves on C and z is a free variable of type X; let a € X(C). 
Then 


(i) Clk f(a) Aw(a@) iff C lk O(a) and C lt (a); 
(ii) C IF O(a) V (ax) iff there is a covering { f;: C; + C} such that 
for each index i, either C; Ik ¢(af;) or C; lt Y(af,); 
(iii) C lk O(a) => (ae) iff for all f: D + C, D |r (af) implies 
Dit Hef); 
(iv) C lk ~(q@) iff for all f: D — C in C, if D |r ¢(af) then the 
empty family is a cover of D. 


Moreover, if ¢(x,y) is a formula with free variables x of type X and y 
of type Y, then for a € X(C), 


(v) C lt Ayd(a, y) iff there are a covering { fi: C; ~ C'} of C and 
elements @; € Y(C;) such that C; |t (afi, G:) for each index i; 

(vi) C lk Vyd(a, y) iff for all f: D — C in C and all B € Y(D), one 
has D |r ¢(af, 8). 
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Remark 2. In general, there is no analogue of 6.1(vi’). But when 
C has products and the type Y of the variable y is representable, say as 
Y = ay(B) where B is an object of C, then 


(vi!) Clk Vyd(a, y) iff C x Blt d(am, 72); 


here m2 € ay(B)(C x B) is the element corresponding to the projection 
C x B -» B. Under the hypotheses, the equivalence of (vi) and (vi’) 
follows exactly as in §6. 


Remark 3. As before, iterated application of these clauses will 
translate forcing statements such as C It ¢(a@, 3,6) into “ordinary” state- 
ments about the topos € = Sh(C, J). This again will use the evident 
clauses translating equality and set membership much as in Proposi- 
tion 2 or 3 of §6. 


Proof of Theorem: To prove the theorem, one can proceed in two 
ways. One way is to rewrite C lt d(a) asa € {x | $(x) }(C), and 
then to use the explicit description of the Heyting algebra structure 
and of the quantifiers, as given in §III.8. Indeed, (i) of the theorem is 
simply a rewrite of §IIT.8(3), (ii) of III.8(5) (for a binary supremum), 
(iii) of III.8(7), and (iv) is III.8(19). As for the quantifiers, clause (v) 
corresponds to ITI.8(13) and (vi) to ITI.8(15) [for the special case in which 
the map ¢: E — F of III.8(13), (15) is the projection X x Y + X]. This 
completes one proof of the theorem. 

The other proof explains the relation of this theorem to Theorem 6.1; 
one can rewrite C' lt ¢(a@) as “a: ay(C) — X factors through {<2 | 
o(x)}", that is, as ay(C’) I+ O(a) in the sense of §6. The clauses of 
the theorem then easily follow from those of Theorem 6.1, using the 
properties of monotonicity and local character as stated in §6, together 
with the following familiar properties of the category of sheaves: First, 
the objects of € which are of the form ay(C’) for some C' € C generate 
E (see §I1I.6); that is, for every sheaf X there is an epimorphic family 
of the form 


{ fi: ay(Ci) + X}. 


In particular, if Y and Z are subsheaves of X, then Y C Z iff for any 
object C' € C any arrow f: ay(C’) — X, which factors through Y also 
factors through Z. Finally, one uses that a family { f;: C; -» C } covers 
in C iff the corresponding family of sheaf maps { ay(C;) — ay(C) } is 
an epimorphic family (Corollary III.7.7). 


Let us consider the special case where € is a presheaf topos Sets©” . 
This category of presheaves is the category of sheaves on C for the trivial 
Grothendieck topology, in which a sieve S on C' covers iff the identity 
map lc lies in 9. In this case, clauses (i), (iii), and (vi) of Theorem 1 
remain unchanged, but the others can be simplified as follows: 
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(ii!) Clk h(a) V (ae) iff Clk O(a) or Clk Y(a); 
(iv’) C lk a¢(a) iff for no f: DC, Dlr d(af); 
(v’) C lk dyd(a, y) iff there exists a 8 € Y(C) such that C'lk ¢(a, 8). 
The “semantics” defined by (i), (ii’), (iii), (iv’), (v’), and (vi) is precisely 
the semantics described by S. A. Kripke in his celebrated Semantic Anal- 
ysis of Intuitionistic Logic [Kripke, 1965]. (For Kripke’s purposes it was 
sufficient to consider the case where C is a poset.) This explains the use 
of his name in the title of §6, while the insight that a similar “semantics” 
can be defined for any topos is due chiefly to A. Joyal. 
In §2, we considered the Cohen topos Sh(P, ——), where P is a poset 
equipped with the double negation topology. In view of the definition 
of a cover in the case of a poset, the clauses (i)-(vi) of Theorem 1 can 
be rewritten in this case as follows [where p, g, and r denote elements 
of the poset P and a € X(p)|: 
(i) pik 6a) A a(a) iff pit (a) and pit W(a), 
(ii) plt p(a) V w(e) iff for any g < p there is an r < q for which either 
rlt d(a-r)orrlt y(a-r), 

(iii) p lk d(a) => a(x) iff for any g < p such that ¢ | g(a - gq), also 
qlt o(a-q), 

(iv) p lt =@(q) iff for no g < p, glk d(a-g), 

(v) plk Syd(a, y) iff for any g < p there are r < g and § € Y(r) such 
that r lk ¢(a-r, 8), 

(vi) pit Vyd(a, y) iff for any q < p and any 6 € Y(q), glk $(a- 4,8). 
These clauses form (a standard variant of ) Cohen’s original forcing defi- 
nition. Thus Cohen’s forcing technique is a method to describe truth in 
the topos Sh(P,-—). Given this correspondence, our proof of the inde- 
pendence of the continuum hypothesis is really a translation of Cohen’s 
original argument—a translation in which the explicit use of a notion of 
forcing is avoided by working directly in the topos. 


8. Real Numbers in a Topos 


A topos of sheaves resembles the topos of sets and so, like Sets, 
will have a ring of “real numbers”. Now an ordinary real is a suitable 
kind of set—say a Dedekind cut in the rationals Q. Hence the Mitchell- 
Bénabou language can be used to lift the definition of a Dedekind cut to 
the sheaf category, so as to describe certain sheaves as “real numbers” 
(i.e, Dedekind cuts) there. For a suitable topos, one can then prove 
a famous theorem of L. E. J. Brouwer concerning these real numbers: 
Every function from these reals to these reals is continuous. This will 
be demonstrated in the next section. 

In the present section, we will primarily investigate Dedekind reals in 
a topos of the form Sh(.X )—sheaves of sets on some topological space X. 
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For such a topos Sh(X) the functor T: Sh(X) — Sets of Chapter II, 
which sends each sheaf E to its set [TE of global sections, has a left 
adjoint A: Sets > Sh(X). For each set $, the corresponding constant 
presheaf is the functor sending each open set U of X to the same set 
S; so the corresponding étale space AS is the projection X x S > X. 
Therefore, for each open set U of X, A(S)(U) is the set of continuous 
functions from U to the discrete space S; that is, the set of locally 
constant functions U — S. (See Exercise II.7; see also Exercise II.15.) 
Although a locally constant function need not be constant, the sheaf 
A(S) is called the constant sheaf corresponding to the set S. 

There is a natural numbers object Nx in the topos Sh(X); as ob- 
served in §1 it is simply the constant sheaf corresponding to the ordinary 
set N of natural numbers: 


Nx = A(N). (1) 


The ordinary sets Z of integers and Q of rationals similarly yield the 
constant sheaves Zx = A(Z) and Qx = A(Q). Notice that Zy isa 
sheaf of rings, while Qx is a sheaf of fields (cf. §II.7). Moreover, Zx and 
Qx are linearly ordered by the obvious subsheaves “<” of Zx x Zx and 
Qx X Qx, respectively. 

Starting from the natural numbers object Nx in sheaves, there is 
also a more categorical way of defining the object of integers and that 
of rationals in Sh(X). One proceeds by imitating one of the usual set- 
theoretic definitions of the sets Z and Q in terms of the natural numbers 
N. For example, one may use 


Z={(n,m)|n,meEN}/~%, 


where ~ is the equivalence relation for which (n,m) ~ (n’,m’) iff n+ 
m' = n' +m. Thus the set Z is constructed from N as the coequalizer 


14,726 
Cite!) Nx N——»Z (in Sets), (2) 


(™2a,716) 


where F is the equivalence relation given by the pullback 


E——_*—_+NxN 


Lh ° 
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[Thus, EF is the set of those 4-tuples (n,m’,n’,m) for which n + m’ = 
n’ +m, while the maps a and } send such a tuple to (n,m’) and (n’,m), 
respectively. | 

But this definition makes sense in any elementary topos € with a 
natural numbers object Ne! Thus, we define the object Zp of integers 
in such a topos € as the following coequalizer in €, like that in (2): 


E==— Nex Ne——»Ze (in €), (4) 


where £ is defined by a pullback in € just like (3). 

One can now easily prove that Z¢e is a ring object in the category 
E (Exercise 13). In the case of a topos € = Sh(X) of sheaves on a 
space X, this construction results in the constant sheaf A(Z) on X 
corresponding to the set of integers considered above; that is, there is 
a bijection Zgy(x) = A(Z). This is an immediate consequence of the 
following lemma. 


Lemma 1. For any space X, the functor A: Sets —> Sh(X) pre- 
serves colimits and finite limits. 


Proof: The functor A has a right adjoint I’, so A preserves colimits. 
On the other hand, since limits in the presheaf category Sets©(* °° are 
constructed “pointwise”, it is clear that the functor Sets + Sets?(* 
which sends a set to the corresponding constant presheaf preserves finite 
limits. But A: Sets — Sh(X) is the composition of this functor with the 
associated sheaf functor, and the latter is left exact (see Exercise I1.6). 


yer 
i 


The object Qe of rationals can be defined in a similar fashion in 
any topos € with a natural numbers object Ng. For example, one can 
imitate the set-theoretic definition of the set Q as the quotient { (n,m) | 
né€Z,meN}/~, where (n,m) ~ (n’,m’) if n(m’ +1) = n'(m+ 1). 
[So the pair (n, m) represents the rational n/(m+1).] One thus defines 
the object Qe of rationals in a topos € as the coequalizer 


(m1 u,m2v) Ze Se Ne eee Qe, (5) 


(m1 0,72U) 


where F' and the maps u and v are defined by the pullback 


fF —+*— Ze x Ne 
| [morn (6) 
Ze x Ne Ze. 


m(1xs) 
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Here s is the successor Ne — Ne, m: Ze X Ze — Ze is the multiplication 

of Ze, and Ng is viewed as a subobject of Ze via the monomorphism 
Ne — > Nex 1 222% Ne & Ne — + Ze 

[the last epi being that of (4)]. 

This general construction need not detain us any longer here (but see 
Exercise 13). For the purposes of this section we wish only to observe 
that, by Lemma 1 again, this construction of the object Q¢ of rationals 
yields the constant sheaf Qx = A(Q) in case € is the topos of sheaves 
on a space X. 

More interesting is the construction of the object R¢ of real numbers 
in a topos € with a natural numbers object N¢. Here, we make essential 
use of the Mitchell-Bénabou language, as well as of the power-object 
operation P: E°P — €E. 

If the ordered set R of reals is given, each real number x “cuts” 
Q CR into the two disjoint subsets L and U described as follows: 


L={qeQ\q<z}, V={qEeQ|zK<gq}. (7) 


Without using R, the subsets have the following properties: (a) Each 
is nonempty; (b) L is downward closed but has no largest element; and 
(c) dual properties hold for U, while L and U together compose all of 
Q@ except perhaps for x in case the real number « is rational. Finally, a 
Dedekind cut is a pair of disjoint subsets (LZ, U) of the set of rationals, 
satisfying the following conditions: 


(i) SagEeQ(qeL), AreQ(reT); 

(ii) Va re Q(¢g<rAreLaqébL), 
YVar€Q(ir<qArEeUsgqev); 

(iii) Vae Q(qe La are Q(reLanq<r)) (8) 
YVa@eEQ(qeUs>2reQ(reUar<q)); 

(iv) Vare€Q(q<ra(qeLlvretd)); 

(v) Lnu=6. 


But this definition of the reals makes sense in any topos € with a natural 
number object Ng! Indeed, defining the order relation < on Qe as a 
subobject of Qe x Qe in the obvious way (cf. Exercise13), the Mitchell-— 
Bénabou language allows us to construct an object Re of € as 


Re = {(L,U) € P(Qe) x P(Qe) | (£,U) is a Dedekind cut}; (9) 
here, “(L,U) is a Dedekind cut” stands for the conjunction of the for- 


mulas (i)-(v) in (8) above, rewritten in the language of € (thus, the 
quantifiers 4g € Q, etc., are replaced by Ag € Qe, etc.). 
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One can show that for an arbitrary topos € (with a natural numbers 
object), the object R¢ defined by (9) is a local ring object in €. However, 
rather than going into such general matters, we prefer to calculate a 
special case here, namely that of the topos Sh(X) of sheaves on a space 
X. In this topos, the natural numbers, integers and rationals are simply 
interpreted as above as the corresponding constant sheaves. However, 
the object Rgp(x), abbreviated as Rx, does have the following direct 
description: 


Theorem 2. The object Rx of Dedekind reals in the topos Sh(X) 
on a topological space X is (isomorphic to) the sheaf C’ of continuous 
real-valued functions on the space X defined on the open sets W of X 
by 

Rx(W) =C(W)={f: W ->R]| f is continuous }. 


The proof uses forcing and sheaf semantics. By (9) above, a section 
of Rx over an open set W C X is a section over W of the subsheaf 
Rx C P(Qx) x P(Qx)), that is, a pair (L,U) of elements of P(Qx)(W) 
such that the five clauses (i)-(v) of (8) hold. Now each clause is a 
formula with no free variables, so it holds when it is forced by W. Thus 
condition (8) on (Z, UV) in the forcing language becomes 


(i) W Ik Ag € Qx (¢ € L) A Fr € Qx (r EU); 

(ii) WlkKVa,r €CQx(q<rArEeLsqeéeL) 
A(r<qAreUsqev); 

(iii) Wilk VqeQx (qe Ls sareQx(relbaAg<r)) 
A(qeU > ar €Qx(reUar <q); 

(iv) W IF Ver €Qx(a<r>(qeELvreld)); 

(v) WIFVgeQx Age LAqgel). 


Now Qx is the constant sheaf corresponding to the set Q of rationals. 
Also L, as an element of P(Qx)(W), can be regarded as a map of sheaves 
y(W) — P(Qx) (by the Yoneda lemma), and hence L can be identified 
with a subsheaf of Qx|W, according to the correspondence 


y(W) —4> P(Qx) = (2%) 
y(W) x Qx ——— 0 
a subsheaf of (y(W) x Qx) =Qx|W 


(and similarly for U). Now we can apply the sheaf semantics described 
in Theorem 7.1. For example “g €¢ L” with L a subsheaf of Qx|W, as 
above with Qx = A(Q) a constant sheaf, becomes “g: W —- Q isa 
locally constant function with q € L(W)”. Thus W lk 3g € Qx (Q € L) 
by clause (v) for the existential quantifiers in Theorem 7.1 becomes 
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“there is an open cover {W;} of W such that for each i there is a locally 
constant function g;: W; + Q with g; € L(W;)”. In this way, each of the 
conditions (i)-(v) above can be translated by Theorem 7.1 to become 
the following conditions (i')-(v’) on the subsheaves LE and U. 


(i’) There is an open cover {W;} of W such that for each 7 there are 
locally constant functions q;, r3: Wi; — Q with gq; € L(W;) and 
TEE U(W;). 

For all locally constant functions g, r: W’ — Q defined on some 

open set W’ C W: if g(x) < r(a) for all e € W’ and r € L(W’) 

then q € L(W’); and if r(x) < q(x) for all c € W' while r € 

U(W’) then g € U(W’). 

For all locally constant functions g : W’ — Q on some open 

subset W’ C W, if g € L(W’) [respectively g © U(W’)], then 

there are an open cover {W;} of W’ and locally constant functions 

r;: W/ — Q such that ri(x) > g(x) for all x € W; and r; € L(W;) 

[respectively r;(x) < q(x) for all x € W/ and r; € U(W/)]. 

(iv’) For any two locally constant functions g, r: W’ — Q on an open 
subset W’ C W such that g(x) < r(x) for all 2 € W’ there exists 
an open cover {W/} of W’ such that, for each index i, either 
q|W! € L(WY) or r|Wf € U(W)). 

(v’) For any locally constant function g: W’ — Q ona nonempty open 
subset W’ C W, not both g € L(W’) and g € U(W'). 


Let us write g for the constant function on X with value qg, where q¢ 
is any rational number. Now consider for a point x € W the following 
disjoint sets of rationals: 


(ii 


—" 


(iii 


~— 


Lz ={q€Q|JopenV CW: re V and qV € L(V) }, 
Uz, ={r€Q|iopenV CW: x2e€V andr|V € U(V)}. 


It readily follows from (i’)-(v’) above that Z, and U, form a Dedekind 
cut in the category of sets; i.e., that L, and U, satisfy the five conditions 
in (8) above. Since there is a unique real number sup L, = inf U, which 
corresponds to this cut (Z,, Uz), we can define a function 


fru: WoR 
by fr,u(x) = sup Lz; in other words, for rationals g, r € Q, 
q<fru(e)<r iffge L, andr €U,. (10) 
It follows from (10) that the function fry: W — R is continuous. 


Indeed, if (g, 7) is a rational interval and x € fru, r), then by (10) and 
the definition of L, and U,, there are neighborhoods V and V’ of x such 
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that g|V € L(V) and7|V’ € U(V’). But then for any point ye VNV’, 
we have again by (10) that y € fru r) ThusVnv'c fiu@ r). 
This shows that fy u(q,7) is an open subset of W. Since the rational 
intervals form a basis for the topology of R, it follows that fru: W —- R 
is continuous. 

Conversely, suppose we start with a given continuous function 
f: W — R. We may then define subsheaves Ly and Uy of Qx|W 
by setting for W’ C W and p € Qx(W’) (that is, p is a locally constant 
function from W’ into the rationals): 
peLl;(W') if Veew’, pix2)< pa (11) 
peUu;(W') iff Ve eW', p(x) > f(x) f° 


This pair of subsheaves (L,Uy) satisfies (i’)-(v’) above: Indeed, (i’) 
holds since we can cover W by the open sets W, = {2 € W | —-n < 
f(x) <n}, wheren € N. Then —7|W,, € Ly(W,,) and n|W,, € Us(W,). 
That (ii’) holds is clear from the definition of Ly and Us. To prove (iii’), 
take any open W’ C W and any gq € Ly(W’). Then if z isa point of W’, 
we have g(x) < f(x). So by continuity of f and density of the rationals 
in the reals, there is a neighborhood V, of z and a rational r, such that 
a(y) < rx < f(y) for all y € Vz. Thus, 7,|V2 € Ly(Vz). Since the sets 
V, cover W’, this shows one half of (iii). The other half (concerning 
Uy rather than Ly) is proved similarly. For (iv’), if g, r: W' + Q 
are locally constant functions with g(x) < r(x) for all « ¢ W’, then 
Wi = {2| f(x) < r(x)} and W2 = {x | q(x) < f(x)} cover W’, while 
q|W, € Ls(W)) and r|W2 € Us(W2). So (iv’) holds. Finally (v’) holds 
trivially. By the equivalence between (i’)-(v’) and (i)-(v) above, it now 
follows that (Ly, Uz) is an element of Rx (W). 

It is, finally, a straightforward matter of spelling out the definitions 
to conclude that the operations thus defined, 


(L,U) > fru 
(Ls, Us) Cf, 


are mutually inverse. Since these operations are, moreover, natural in 
the open set W C_X, one obtains an isomorphism of sheaves Rx & C, 
and the theorem is proved. 

For the identification of the reals in some topoi other than those of 
the form Sh(X), see Theorem 9.2 below, and Exercise 14. 


Rx(W) —— Cw 


9. Brouwer’s Theorem: All Functions are Continuous 


Around 1924 L. E. J. Brouwer “proved” that all functions defined on 
(a closed interval of ) the real numbers and with real numbers as values 
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are continuous! Of course, classically this theorem is false. But Brouwer 
worked with an intuitionistic view of all mathematics. He considered real 
numbers as constructed by the method of Cauchy sequences of rational 
numbers. These sequences were “free choice sequences”—so that at any 
given moment the working mathematician has only incomplete informa- 
tion on the sequence. According to Brouwer, these sequences satisfied 
certain “self-evident” principles—comparable in status (for Brouwer) to 
the principle of induction for natural numbers. From these principles, 
he was able to show that whenever a function from say the interval (0, 1] 
to the reals R_ is well-defined, it has to be continuous. 

Now topoi are “generalized” universes of sets, and the logic of such 
universes is in general intuitionistic (the logic is classical precisely when 
the topos is Boolean). Therefore, it is natural to ask whether there are 
perhaps topoi which resemble Brouwer’s world to such an extent that 
all functions from reals to reals are continuous. This question can be 
made more precise, using the Mitchell-Bénabou language: given a topos 
E with natural numbers object Ng, we have seen in the previous section 
how to construct the object Re of reals in E€. We say that “all functions 
from Re to itself are continuous” in € if the following formula of the 
Mitchell-Bénabou language is valid in € (the formula is the usual «-6 
description of continuity): 


Vf € Ree Ve € Re (€ > O= Va € Re 6 € Re (6 > OA } 
Vye Re (wx-—b6<y<a+6> f(x)—e< f(y) < f(z) +€))) 
(1) 

Here Re®é is the exponential in the topos €. Thus, our question is: 
are there topoi such that (1) is valid in €? In this section we will 
provide a positive answer to this question: there do indeed exist many 
topoi in which (1) is valid. For example, one such topos is the so- 
called “gros topos” T = Sh(T) of sheaves on the site T of topological 
spaces, equipped with the open cover topology (see §III.2). In a nutshell, 
the reason is the following. For this topos T, one can prove a result 
analogous to Theorem 8.2: if Rr denotes the object of Dedekind reals 
in the topos J and X is an element of the site T, then R7(X) is precisely 
the set of all continuous functions from X to the ordinary real line R 
(cf. Theorem 9.2 below). But R is also an object of the site T, so now the 
object Rz of Dedekind reals has become representable in T! Continuity 
of all functions then follows, essentially by the Yoneda lemma. 

Let us turn to the details. From now on T denotes a fixed small 
full subcategory of the category of topological spaces, with the following 
properties: 


(i) T is closed under finite limits; 
(ii) if X € T and U is an open subspace of X, then U ¢€ T; 
(iii) the real line R is an object of T. 
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The Grothendieck topology on T is given by the open covers; in other 
words, a basis for this topology is formed by the families of the form 


{ fi: Ui X ie Th, 


where {U;} is an open cover of X and each f, is the embedding of U; in 
X. We wish to prove that (1) is valid in the topos 


T =Sh(T) 


of sheaves on T. 
First, we consider some special sheaves on T. If A is a fixed topo- 
logical space, the functor 


C(—, A): T°? — Sets, (2) 


which sends an object X € T to the set of continuous functions X —- A, 
is a sheaf on T. All the sheaves on T which we shall consider are of this 
simple form (2). 

As for any Grothendieck topos, there are adjoint functors 


[: T = Sh(T) —— Sets: A. (3) 


Here, I is the global sections functor; since the one-point space 1 is an 
object of T [cf. (i) above], one has for any sheaf E on T that ['(E) = 
E(1). The functor A is also easily described explicitly, much as for a 
topos Sh(.X) of sheaves on a single space X: 


Lemma 1. For any set S, A(S): T°? —+ Sets is (isomorphic to) 
the sheaf C( — , S) of continuous functions into the discrete space S. 


Proof: One might proceed by showing that two applications of the 
plus-construction of §III.5 transform the constant presheaf T°P —. Sets 
with value S into the sheaf C(—,S). Alternatively, it is not difficult to 
prove directly that the functor Sets —+ Sh(T), defined by S +> C(—, 5), 
is left adjoint to I. Indeed, for a set S and a sheaf E on T there is a 
bijective correspondence 


@: C(-,S)— E 
(4) 
wy: S—— T(E) 


as follows. Given a natural transformation ¢: C(—,5) - E, let y be 
the component ¢,: S = C(1,S) > E(1) = I(£). And given y, we 
construct a natural transformation ¢: C(—,S) — E by choosing for 
each space X € T the component 


ox: C(X,S) 4 E(X) 


9. Brouwer’s Theorem: All Functions are Continuous 327 


described as follows. For a € C'(X, S) (—that is, a a continuous function 
X —» S—) the sets U, = a~1(s) (for s € S) form an open cover of X. 
For each s € S, 4(s) € E(1) restricts via E(1) — E(U,) to an element 


w(s)-U, € E(U,). 
Since the sets U, are pairwise disjoint, these elements automatically form 
a matching family, hence can be patched together to a unique element 


@x(a) € E(X). This is summarized by the following display: 


X <———— JU, 


if (in T) 
E(x) ——— E(U,) ———— E(1) (restrictions of E) 
x (a) -—— \(s) - Us + o(s) (definition of ¢). 


It is readily verified that ¢x: C(X,S) > E(X) is natural in X, and 
that this indeed describes a bijective correspondence as required in (4). 

The topos T = Sh(T) has a natural numbers object Nz. As in any 
Grothendieck topos, we have Nz = A(N) (as in §1). So by the preceding 
lemma, the sheaf Nz can be described as 


Nr =C(—,N). (5) 


Now Lemma 1 of §8 is obviously still true when we replace Sh(.X) by an 
arbitrary Grothendieck topos of sheaves on some site—the same proof 
still works. It follows as in §8 that the objects Zz and Q7 of integers and 
rationals in our topos T are similarly given by the sheaves of continuous 
functions into the discrete spaces Z and Q. In particular, 


Qr(X) ={f: X ~Q| f is continuous 


(for the discrete topology on Q) } (6) 
={f:X —>Q]| f is locally constant }. 


We now prove the analogue of Theorem 8.2 for the topos T. 


Theorem 2. For the topos T = Sh(T), the object of Dedekind reals 
Ry is isomorphic to the sheaf C = C(—,R) of continuous real-valued 
functions, with 


C(X) ={f: X +R| f is continuous} (for X € T). 


Notice that R is an object of T, and therefore that C' is the repre- 
sentable sheaf y(R) = T(— ,R). 
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Proof: The proof of this result is very similar to that of Theo- 
rem 8.2. For an object W of the site T, an element of R7(W) again con- 
sists of a pair (L,U), where L and U are disjoint elements of P(Q7)(W) 
which satisfy the five clauses (i)-(v). As in the proof of Theorem 8.2, 
the Yoneda lemma shows that L € P(Q7)(W) corresponds to a map 
y(W) = T(—,W) — P(Qr); or if we write P(Q7) = 997 and take the 
exponential transpose, L corresponds to a subsheaf of T(—,W) x Qr. 
Exactly the same applies to U. We will, therefore, identify Z and U with 
subsheaves of T(—,W) x Qz. Thus, L consists of pairs (a, p) where 
a: Y — W is a map from some space Y in T and q: Y > Q is locally 
constant [cf. (6)]. To say that this set L is a subsheaf means, first that 
(a, p) € L implies (af, p3) € L for any map B: Z > Y in T, and second 
that if (a|Vi, p|Vi) € L for each V; in some open cover {V;} of Y, then 
(a,p) € L. The same applies to U. 

Now to say that (Z, UV) € (P(Q7) x P(Qz))(W) lies in the subobject 
Rr — P(Qr) x P(Qr) of Dedekind cuts means that the conditions (i)— 
(v) from the proof of 8.2 hold (where Qx is replaced by Q7 and where 
“W | ...” now refers to the sheaf semantics for the site T). Since Qz 
is the constant sheaf A(Q) on T and the covers of W in T are given by 
ordinary covers by open subsets W; C W, clauses (i)-(v) are equivalent 
to the conditions (i’)-(v’) as stated there. [Except that in (ii’), (iii’), 
and (v’), the phrase “all open W’ C W” has to be replaced by “all 
maps W’ > W in T for any object W’ of T”.| The equivalence between 
(i)-(v) and the slightly modified (i')-(v’) is again a matter of spelling 
out the sheaf semantics. Thus, as before, we can associate with the cut 
(L,U) a continuous function 


fi: W-R 
defined by setting for c € W and gq, r € Q: 


q<fru(x)<r_ iff for some open V C W with eV, 
(V—>W,q) € Land (V—-W,r) €U. 


(As before, g and 7 stand for the constant functions V > Q with values 
q and r.) 

Conversely, given a continuous function f: W —> R, one defines 
subsheaves Ly and Uy of T(—,W) x Qz by setting for a continuous 
map a: Y — W in T and a locally constant function p: Y — Q, 


(a,p) Ly iff p(y) < fa(y) for ally EY, 
(a,p) Uy iff p(y) > fa(y) for all ye Y. 
( 


If we now identify these subsheaves of T(— , W) x Q7 with elements of 
P(Qr)(W) as before, the pair (Ly, Uy) defines an element of R7(W). 
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Furthermore, as in Theorem 8.2, describing the Dedekind reals, the 
bijective correspondence 


fr (L5,Us): C(W) — Rr(W), 
(L,U) +> fru: Rr(W) + C(W) 


is natural in W and gives an isomorphism C' © Ry as asserted in the 
theorem. 


We can now prove that “Brouwer’s Theorem” holds in the topos T 
of sheaves on T. 


Theorem 3. In the topos T = Sh(T), all functions from Rr to 
Ry are continuous, in the precise sense that the sentence (1) of the 
Mitchell-Bénabou language (with T for E) is valid in T. 


Before embarking on the proof of this theorem, let us consider some 
of the expressions occurring in the Mitchell-Bénabou sentence (1), de- 
scribing continuity, for the special case where € = T. By Theorem 2, 
Ry is the sheaf of continuous real-valued functions. For an element 
g: Y + R of Rr(Y), where Y € T, one has 


Y lk g>0 iff g(y) >0 forall yeY (7) 


by the expected definition of the order on Rr. The object R77 is the 
exponential in the topos. Thus, by the Yoneda lemma, for any object 
Y of T, an element F of Ry®7(Y) is given by a natural transformation 
T(-,Y) — Rr®*, or equivalently, via exponential transposition, by a 
natural transformation (still denoted by F) 


F: T(-,Y)x Ry > Rr. (8) 


By Theorem 2, Ry is the representable sheaf T(— , R); also by assump- 
tion T is closed under products so T(—,Y) x Rr = T(—,Y x R). By 
the Yoneda lemma again it has to follow that an F as in (8) corresponds 
to a uniquely determined arrow Y x R — R in T, i.e., to a continuous 
map f: Y x R > R. The natural transformation F and the map f 
determine each other for each Z in T by 


f = Fyxr(m,72), (9) 
Fz(a,9)=fo(a,g) fora: Z+Yandg:Z—>RinT. (10) 


Proof of Theorem 3: As before, let 1 be the terminal object in 
the site T. Then the sentence (1) describing continuity is valid in T iff 
(1) is forced at this terminal object 1, (cf. §6). And by clauses (vi) and 
(iii) for sheaf semantics (Theorem 7.1), this is the case iff for any object 


330 VI. Topoi and Logic 


X of T, any F € Rr®7(X) and any ¢ € Rz(X) such that X It € > 0, 
one has 
X IkV¥a € Rr 36 € Rr (6 >OAVy € Rr(...)), (11) 


where (...) is asin (1) but with F substituted in place of f. So let us fix 
such X, F’, and ¢ with X lke > 0. Then F corresponds to a continuous 
map f: X x R > R as in (9) and (10) above, while e: X — R is an 
everywhere positive continuous function [cf. (7)]. We shall now prove 
(11). 

Recall first from the Remark (vi’) following the statement of Theo- 
rem 7.1 that for any formula ¢(x) whatsoever 


X\k V2 € Rr g(x) iff X x Rlt (72), 


where m2 € Rz(X x R) is the second projection X x R — R. So to 
prove (11), it suffices to show (12): 


se ae en ree ara a ag (12) 
(F (m2) — em < Fly) < F(m2) + m))). 


To this end, consider the given f: X x R -> R and e: X — R. Since 
e: X — Ris continuous and e(x) > 0 for all points z € X, the continuity 
of f implies for any x € X and any t € R that there is a neighborhood 
V, € X of x anda real number 6 > 0 (depending on z and €) such that 
for any z € V, and any s in the interval (¢ — 6,t + 6) one has 


flz8) € (Flat) — Sele) flat) + 5¢(2). (13) 
We claim that it follows from (13) that 


1 1 1 1 
Ve x Gerke 3°) Ir (Vy € Ry (m2 — 56 <y < m2 + 58) = 
F (mq) — em, < Fly) < F(m2)+6¢m). (14) 


Since the open sets V, x (f— $6,¢+ 46) form a cover of X x R, it would 
follow from (14) by the “local character” of forcing (as stated in §7) that 
(12) holds. So a proof of (14) would indeed complete the proof of the 
theorem. 

In order to prove (14) from the continuity condition (13), take any 
arrow 3: Y — V, x (t— 36,t+ 36) in T from any space Y, and consider 
an element g € Rr(Y), that is, a continuous function g: Y — R, with 
the property that 


¥ Ik (ra 0B) —38< 9 < (m2 08) +56 (15) 
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This means that for any point Cc Y, 


m6 (6) ~ 56 < o(0) < m26(0) + 56 (16) 


Because of the codomain of 8 we have |72(¢) —t| < $6, so |g(¢)—t| < 6 
for any ¢ € Y. Therefore, by (13) 


Is(™8(0), 9(6)) ~ Flast)] < 5emB(6)- (17) 


On the other hand, the given codomain of @ means that 8(¢) € Vz x 
(t— $6, + $6), so that, again by (13), |f@(¢) — f(x, t)| < $71 G(¢), and 
hence with (17) 


IFAC) — F(™B(¢), 9(6))| < em A(). (18) 


But fof: Y — R is precisely Fy(m 0 8,72 0 8), by (10), while f o 
(78,9) = Fy (mG, 9). So (18) means exactly that 


Y |k [(F 0 718) (m28) — (em B) < (F om 8)(g) 
< (Fo (m4 ))(m28) + (em B)], (19) 


where the composite F o 7 denotes the restriction of F along 
Rr®7 (1,8): Rr®™(X) + Rr®7(Y). Since this holds for all 6 and 
g which satisfy (15), we conclude that (14) holds, as was to be shown. 
This proves Theorem 3. 


10. Topos-Theoretic and Set-Theoretic Foundations 


As we have repeatedly suggested before, topos theory (more pre- 
cisely, the first-order theory of elementary topoi) can serve as a foun- 
dation of mathematics, alternative to the common foundation by fa- 
miliar axiomatizations of the membership relation, as in the axioms of 
Zermelo—Freenkel set theory. A general elementary topos is much like 
a set-theoretic universe, but “its internal logic” is intuitionistic. To get 
a universe more suitable for classical mathematics, one would at least 
need a Boolean topos. Moreover, in the category of sets, as formulated 
in the Zermelo—Frzenkel axioms (possibly including the axiom of choice), 
there is no distinction between “global and local existence”: these sets 
form a well-pointed topos (see §1). 

In this section, we will briefly investigate the foundation of mathe- 
matics as based on the axioms for a well-pointed (and hence Boolean) 
topos. From this point of view, the notion of a “function” —an arrow in 
the well-pointed topos—is the basic concept, rather than the notion of 
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set-membership, and the Mitchell-Bénabou language allows for conve- 
nient manipulation of replicas of membership. 

The resulting topos theory does not have the full strength of the 
conventional Zermelo—Frzenkel set theory. Instead, it is to be compared 
to a weaker version of set theory in which the comprehension axiom is 
used only for those formulas in which all quantifiers are “restricted” — 
that is, for formulas ¢(r) in the standard set-theoretical language in 
which every quantifier has the form “Vz € b” or “Sx € c”, for suitable 
sets b or c. Specifically, we will show that the axioms for a well-pointed 
topos with a natural numbers object and with the axiom of choice are 
equiconsistent with RZC: restricted Zermelo set theory with the axiom 
of choice, sometimes called “bounded Zermelo” . 

The axioms for RZC are formulated with only the usual primitive 
relations € for membership and = for equality. They are: 


Extensionality: x = y iff, for all t, t € 2 iff t € y. 

Null Set: There exists a set @ with z ¢ @ for all z. 

Pair: For all x and y there exists z with t € 2 ifff=cort=y. 

Union: For all z there exists a set u with t € u iff there is some y with 
teyea. 

Power-set: For all x there exists vy with t € v iff t C x. 

Foundation: For all z 4 @ there exists a y € x with yNz =9. 

Restricted Comprehension: For every set ) and every formula ¢(z) 
in which all quantifiers are restricted, there exists a set u with 
x €uiff (x) andz € b. 

Axiom of Infinity: There exists a set N such that # ¢ N and ze N 
implies x U{x} € N. 

Axiom of Choice: Any of the usual formulations; for example, if z is 
a set with y 4 @ for all y € x, then there exists a function f on x 
with f(y) € y for all y € a. 


Our equiconsistency proof will proceed by showing that from each 
model S of RZC one can construct a well-pointed topos € (more pre- 
cisely, a model of the first-order theory WPT of well-pointed topoi) with 
choice and a natural numbers object—and conversely, for each such well- 
pointed topos € a model of RZC. We shall informally describe these con- 
structions by means of the English language. However, without going 
into the details, we mention that this informal argument can be given 
within the language of elementary arithmetic. This means that each 
construction can be formalized within arithmetic in a purely syntactic 
way; that is, as a translation of the language of RZC into that of WPT, 
or vice versa. 

The first of the constructions is that which builds a well-pointed 
topos € from a model S of set theory; this is just the familiar construction 
of the category of all sets of S; it uses only the axioms of RZC. The rest 
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of this section is concerned with the second construction: that of an 
RZC model from a well-pointed topos. 

First we record some properties of points (that is, of arrows 1 — X, 
ie., global elements of X) in a well-pointed topos. 


Proposition 1. In a well-pointed topos E: 


(i) If A and B are subobjects of X then A < B iff every global 
element p: 1 — X of X which factors through A also factors 
through B. 

(ii) If X #0, then the unique map f: X — 1 splits. 

(iii) A mapa: X — Y is epi iff for every p: 1 — Y there isag: 1 > X 
with aq = p. 

(iv) Ifa: X + Y and B<Y while every gq: 1 -> X has ag factoring 
through B, then a itself factors through B. 


Proof: In (i), if A < B fails in Sub(X), then AN B is a proper 
subobject of A. Thus, the characteristic functions h: X — 0 of A and 
k: X > ND of ANB differ. Since 1 is a generator of E there is therefore a 
point p: 1 + X with hp # kp: 1 > ©. But in a well-pointed topos €, by 
Proposition 1.7, there are only two maps 1 —» 0; namely, the maps true 
and false. If hp = true, then p factors through A and thus by hypothesis 
through B. Therefore p factors through the pullback AM B so that kp 
is also true: 1 —» 0 and kp = hp, a contradiction. Otherwise, hp = false 
and, therefore, kp = true, so p factors through AN B and, therefore, 
through A, another contradiction. The converse of (i) is immediate. 

For (ii), consider the coproduct inclusions m1, 72: X + X +X of X 
in the coproduct. If, = 7, their pullback is trivially X, but coproducts 
are disjoint, so X = 0, against the hypothesis. Otherwise, 7, 4 12, so 
since 1 generates, there is p: 1 + X with mp 4 nop; then fp: 1 > 1 
must be the identity map to the terminal object 1, so p is the desired 
splitting of f. 

In (iii), if a isepi and p: 1 > Y, their pullback Q gives an epi t: Q > 
1. If Q = 0, then ¢ is also mono, so 0 & 1, a contradiction, because E is 
nondegenerate. Hence, t splits by (ii), giving a map h: 1 > @ and thus 
(draw a diagram) a map q: 1 >» X with p= ag. Conversely, suppose a 
is not epi. There are then arrows f, 9g: Y ~ Z with fa = ga but f # g. 
Since 1 generates, there is a p: 1 > Y with fp # gp. But by hypothesis 
there is then a g: 1 — X with ag = p so fag # gag, contradicting 
fa=ga. 

For (iv), let A be the image of a; we need only prove that A < B in 
Sub(Y). To apply condition (i), consider any global element p: 1 > Y 
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which factors through A 


as above. Then by (iii) p also factors through X as p = ag for some 
global element g of X. The hypothesis of (iv) then states that ag = p 
factors through B. Thus, by (i) A < B, as required. 


With the results from Proposition 1, consider now the Kripke—Joyal 
semantics for a well-pointed topos €. Let ¢(y) be a formula of the 
Mitchell-Bénabou language of € with a free variable y of type Y , defining 
a subobject {y | d(y)} of Y, while a: X — Y is a generalized element 
of Y. By the definition of forcing, X It f(a) iff a factors through { y | 
¢(y) }. By (iv) above this holds iff every ag factors through { y | ¢(y) }. 
In other words 


X lt f(a) iff for all g: 1 + X, 1 |r d(aq). (1) 


This states that every forcing condition for truth at an object X of € can 
be completely described by forcing at the terminal object 1 of €. Thus, 
the previous inductive clauses of Theorem 6.1 reduce to the following 
statements about global elements a: 1— Y and @:1—-Y: 


lIk¢(a) Av(a) iff 1 lk d(@) and 1 It (a), 

Lit¢(@) Vv(a) iff 1 It h(a) or 1 IF Y(a), 

1 |Ik¢(a) > w(a) iff when 1 It ¢(@) then also 1 It y(a), 
1 I--¢(a@) iff not 1 Ik d(a), 

1 IFSy¢d(a, y) iff for some 8:1 — Y, 1 It g(a, B), 

1 IrVyd(a, y) iff for every 8: 1 Y, 1 It d(a, 8). 


These are just the familiar semantic rules for the standard interpretation 
of the classical logical connectives! In other words, in a well-pointed 
topos, generalized elements are not needed to describe truth, because 
global elements suffice—and suffice in the familiar way! 

We now construct a model S of RZC from a well-pointed topos with 
natural numbers object and choice. This construction depends on the 
idea of picturing a set x as a tree: z is the root of the tree; on the first 
level are the members of z, joined to x; next above there are the members 
of the members y of z, each joined to its y; and so on, to include thus 
all the elements in the “transitive closure” of x. (Recall that a set t is 
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said to be transitive iff y € x € t implies y € t; the replacement axiom 
of Zermelo—Freenkel set theory—which we do not assume—implies that 
every set is contained in a transitive set.) Here is a picture of a set z as 
a tree. 


Elements of x; ° e e e e e 


Vo 
024 


Within our elementary topos €, we proceed to describe the sort of 
tree here intended, making liberal use of the Mitchell~Bénabou language 
to describe subobjects. 

A tree in E is an object T of € with a binary relation RT x T, 
written in the language as x < y for x, y in T. [As usual, 2 < y stands 
for cs < yA-(x = y), etc.] The axioms for a tree are: 


Elements of z 


(i) Poset: R defines a (reflexive) partial order < on T. 

(ii) Root: There exists 0 € T such that 0 < ¢ for all t € T. 

(iii) Tree Property: For all t € T, the subset |t = {x | 2 < t} is 
linearly ordered by (the restriction of) the relation < of T. (The 
set |¢ is often called the downward closure of t.) 

(iv) Well-founded Down: For all S C T with S # @, there exists a 
y € S such that one never has y > z for z € S (thus, y is minimal 
in S with respect to <). 

(v) Well-founded Up: For all S C T, if S 4 @ then there exists a 
w € S such that one never has z > w for z € S (thus w is maximal 
in S; this requirement is closely related to the usual set-theoretic 
axiom of foundation, as in the formulation of RZC above). 

(vi) Rigid: The only automorphism a: T — T is the identity. 


Here a morphism 8: T — T”’ of trees is a map T — T”’ in € which 
preserves the root 0 as well as the relation < (so an isomorphism T — T’ 
of trees is a morphism with a two-sided inverse, and an automorphism 
of T is an isomorphism of T to itself). 

We have stated the axioms for a tree T in an informal way, as if 
T were a set with actual elements ¢. Fortunately, they agree with the 
formal interpretation in the Mitchell-Bénabou language of € when each 
“element” t € T is read as a global element t: 1 — T. For example, 
Axiom (ii) holds in € if the formula 4sVi(s < t) of the language of € 
holds in €, where s and ¢ are variables of type T; in other words, if 
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1 It AsVi(s < t). As explained above, if € is a well-pointed topos, this 
simply means that there is an arrow, call it 0: 1 — T, such that for any 
other such arrow a: 1 -» T, the pair (0,a): 1 — T x T factors through 
the subobject RT x T defining the order relation on T. [Axiom (i) 
implies that the arrow 0: 1 — T with this property is unique.] Similarly, 
for example, Axiom (iv) should be interpreted as requiring the validity 
in € of the formula 


VS(at(t € S) > Ay(y € SAVz € Say > z))), 


where S$ is a variable of type P(T) and t, y, and z are variables of 
type T. Again, if € is well-pointed we can use the simplified forcing 
definition as explained above, and show that this formula is valid iff for 
every subobject ST (corresponding to a global element 1 — PT), if 
S # 0 (the initial object of €), then there exists a y: 1 — S such that 
for all z: 1 -+ S we have that if (z,y): 1 — T x T factors through R, 
then y = z. Validity of the other axioms for a tree T in a well-pointed 
topos € can similarly be restated in terms of “global elements”—and the 
statement is exactly the usual “naive” meaning of the axioms. 

In discussing trees (in the above sense) in €, we call a global element 
t: 1» T in € a node of T. Each such node determines a subtree (the 
upward closure) 

tt={ar|t<-zx}, 


called a branch of T; such a branch again satisfies the axioms, if T does. 
We also say that a node ¢ covers a node s iff t < s and there is no node u 
of T with t < u < s. The nodes of T covered by the root are called points 
of T; in the model S to be constructed, the points are the elements of the 
set that T is meant to represent, as described below. In the Mitchell— 
Bénabou language, we can define a corresponding subobject P C T of 
the points of T by the formula P = {t €T | Vs € T(s <t 4s =0)}; 
the arrows 1 — P are then precisely the points of T. From Axioms (iii) 
and (iv) above it readily follows that for every node t 4 0 of T there isa 
unique point p: 1 — T with 0 < p < t; one might call p the “ancestor” 
of the node t; for this we write p = At, with A for ancestor. 

We shall occasionally use “floppy” trees, which satisfy all the listed 
axioms except rigidity. However, the rigidity axiom is necessary if a tree 
is to be interpreted as the “set” of its points p. (Indeed, if a floppy tree 
has distinct points p, and p2 such that tp, and fp2 are isomorphic, then 
p, and po would represent the same set, which would be counted twice 
as an element of the set represented by T. However, the isomorphism 
Tp: & Tpe can be extended to a nontrivial automorphism of T, which 
interchanges p; and po; so this situation is excluded for rigid trees T.) 
This extension of fp.) & fp2 to an automorphism is included in the 
following: 
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Lemma 2. A tree F is floppy iff there exists a node t of F covering 
two distinct nodes x and y and such that the upward closure Tr, as a 
tree, is isomorphic to Ty. 


Proof: The following informal sketch can readily be translated into 
a rigorous proof by means of the Mitchell~Bénabou language. When 
there is such a pair of nodes z and y in F, there is clearly an automor- 
phism a: F — F carrying fx to ty by the given isomorphism and Ty 
back to Tx, but leaving all other nodes fixed. Conversely, if a: F — F 
is an automorphism different from the identity, pick a “lowest pair” 
of distinct nodes x and y with a(x) = y (such a pair exists, by well- 
foundedness down.) Then Tz © fy; moreover, x 4 0, so x is covered by 
a node ¢ which must be fixed by a, since x was lowest. 

With these preliminaries, we now construct from € a model of RZC. 
Let S be the collection of isomorphism classes of such rigid trees in €. 
(Alternatively, S is the collection of all such trees, but with the equality 
predicate JT; = T> interpreted as “T, is isomorphic to T)”.) We shall 
prove that S is a model of the theory RZC, where the membership 
relation € is interpreted as 


T, € T, iff there is an isomorphism TJ, & fp for some point p of Th. 


(By rigidity and Lemma 1, the point p is unique.) We now verify the ax- 
ioms of RZC; again, we use informal language, and usually avoid writing 
down expressions in the Mitchell~Bénabou language. (The passage from 
informal description to a proof using this language is straightforward, 
because forcing for a well-pointed topos has, as listed above, just the 
standard semantic interpretation. ) 

Extensionality. This amounts to proving the validity in € of the for- 
mula in the Mitchell-Bénabou language for € which expresses, for trees 
T and T’, that there is an isomorphism from T to T’ iff for any point 
p of T there is a point p' of T’ with Tp & Tp’, and also for any point 
p! of T’ there is a point p of T with tp = Tp’. The “only if” direction 
is clear, because any isomorphism of trees a: T — T"’ sends points p of 
T to points a(p) of T’ with Tp = Ta (p). Conversely, suppose for every 
point p of T that the tree fp is isomorphic to a tree f p’ for some point p’ 
of T’. This p’ is unique, by rigidity, so this gives a morphism f: p> p’ 
in € from the object of points of T to that of points of T’, such that Tp 
is isomorphic to Tp’ by some isomorphism ap: Tp & Tp’. Similarly, in- 
terchanging T and T’, we find a morphism g from points of T’ to points 
of T for which there is for each p’ an isomorphism f,: Tp’ & Tg (p’). 
By rigidity, f and g are mutually inverse, as are ap and (y(p) for each 
p. Then one obtains an isomorphism 6: T — T’ by A(t) = f(t) iftisa 
point of T, while @(¢) = ap(t) if p is the ancestor of the node t of T. 
Null Set. The null set @ is realized by the tree which consists only of 
the root 0. 


338 VI. Topoi and Logic 


Pair. The existence of the (unordered) pair { x, y } of two sets x and y 
is witnessed by the construction on trees which, given two trees JT; and 
T, puts them next to each other and adds a new root; as in the figure 


\/ \/ 


wee 


Union. To form the union of a set represented by a tree T, one simply 
deletes from T all its points and retains the partial order < restricted to 
the remaining nodes. The newly resulting tree will satisfy all the axioms 
except possibly rigidity. By Lemma 1 and rigidity of T, rigidity of the 
new tree can only fail for nodes ¢ and ¢’ in the second layer of T, i-e., 
in the first layer (points) of the tree proposed to represent the union— 
and rigidity only fails if tt & Tt’. So use this to define an equivalence 
relation on the nodes in the second layer (nodes covered by points of T’), 
choose one ¢ in each such equivalence class and take (JT to be the tree 
consisting of all tt for the chosen t, plus the root of the original tree T. 
For example, if T represents a set of the form {{z,y},{z}}, where z 
and y are represented by 7, and T», then (JT is the tree as pictured 
below: 


Ti To Th 
ee Aa A 
e Ne ye e ye e 
e 2. 2 e 
Oe 
The Given Tree T 


qT, Tp qT, T, To 
is Ne RS awe ned 
Oe 


Oe 
The Nonrigid Intermediate Tree The Tree Ur 


10. Topos-Theoretic and Set-Theoretic Foundations 339 


Power-Set. The “power-set,” Pow(T) of a (rigid) tree T may be con- 
structed from T as follows: Pow(T) has a root 0; above it, it has a point 
S for each subset S of the set P of points of T; above such a point S 
of Pow(T), it has a second layer consisting of those points p of T which 
belong to the subset 5S; and after that, the tree grows upwards above 
each such p just as it does in T above that p. For example, draw the 
picture when P has just two elements. 

This intuitive description can easily be made into a construction of 
an object in the topos €: let P c T be the object of points of T, and 
P(P) =? its power object. Then form the object B of € defined by 


B={(t,S)|t#0inT,S CP, and Ate S}; 


here At is the ancestor of T; i.e., the unique point of T below or identical 
to t. Now let Pow(T) be the coproduct in € 


Pow(T) = B+ P(P) +1, 


with “elements” (¢,S) € B, S € P(P), and 0 € 1; define a partial order 
here by taking 0 as the (new) root, and 


(t1, S} (te, S2) iff S, = S» and ty < to in T, 
(t1, $1) iff S, = S, 
S$, < So iff S$, = So, 


0<S all S. 


y< 
g< 


One readily proves that any tree 7, representing a “subset” of T' is 
realized as a point of Pow(T). Indeed, if T, represents a subset of T (so 
that R € T, implies R € T for all trees R), then by definition of the 
membership relation among trees, for any point p, of JT, the tree Tp, 
has an isomorphic copy Tp for a unique point p of T. Let S be the set of 
all such points of T. Then S, viewed as a point of Pow(T), realizes the 
tree T,, in the sense that 7) is isomorphic to the subtree TS of Pow(T). 

Routine arguments show that Pow(T) so defined satisfies all the first 
five axioms for a tree. As for the axiom of rigidity, we apply Lemma 2 to 
the subtrees of Pow(T) of the form T0 = all of Pow(T), TS where S C P 
is a point of Pow(T), and ](t,S) where At is in S. In order to apply 
Lemma 1, suppose there are distinct nodes ; and &2 in Pow(T), both 
covered by a node ¢, such that Té, and T&, are isomorphic. Clearly &, 
cannot be of the form S,, since ($1) = ]( $2) implies that 9, = So, 
by rigidity of T. If €) = (t,,5,) and 9 = (te, S2), then since € and & 
are both covered by the node ¢ above, we have S; = Se, and rigidity of 
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T would imply t; = tz. The other cases (such as £) = S, £2 = (t,S’)) 
are immediately excluded since €, and £2 are assumed to be covered by 
a common node. 

Foundation. Let T be a tree satisfying (i)~-(iv), representing a 
nonempty set z. We wish to show that the set x has the property 
that dy(y € x and yNz = 6). In terms of the tree for x, this means 
that there exists a point p in T such that for no node t of T covered by 
p, can one have ft & Tq for some point g of T. Let S be the following 
set of nodes of T: 


S={t| point g of T: Tt = Tq }. 


Clearly S' contains all the points of T—in particular, S is nonempty 
since we assume that T represents a nonempty set. By (iv), there is a 
maximal node m in S. Let r be the point of T for which there is an 
isomorphism a: tr — fm. If no node t € T which is covered by r 
belongs to S, then r represents a set which is disjoint from x, and we 
are done. Otherwise there exists a node t, covered by r, and a point s of 
T for which there is an isomorphism @: ts —+ Tt, as suggested in the 
following figure: 


cy 


© eat’) 
po 
me See eee 
hee SV 
~o2 eof e@ @ @ 
ee ee AN 
e or es 
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Then the composite a o @ is an isomorphism with Ts & Tt & fa (t). 
Thus, the node a(t) is one of the elements of S, while the existence of 
the isomorphism a shows that t > r whence a(t) > m, so that m is not 
the maximal node in S, a contradiction. 

Comprehension. Suppose ¢(z, y) is a set-theoretical formula with free 
variables x and y, with all quantifiers restricted. We view y as a parame- 
ter, and wish to show that the comprehension principle for g(x, y) holds 
in the context of trees. Thus for any set s we want the set 


r={x€s| (x,y) }- 
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More explicitly we wish to show that for given trees S and T there exists 
a tree R such that the set-theoretical formula Vz € S(6(xz,T) @ x € R) 
holds. This formula, as well as ¢(2, y) itself, is built up from € and = 
using the logical connectives and restricted quantifiers. We can therefore 
spell out the definition of membership € and equality = among trees in 
terms of points and isomorphisms of trees as above, and rewrite ¢(z, T) 
as a formula o(z, T) of the Mitchell~Bénabou language of the topos €, 
where in (zx, T) the variable x ranges over the points of the given tree S. 
(To make this translation into the Mitchell-Bénabou language possible, 
we indeed need to assume that ¢ has only restricted quantifiers since 
these are the only quantifiers available in the Mitchell-Bénabou lan- 
guage: each bound variable has a given type.) For example, if d(z, y) is 
of the form Vu € yav € u(...) and the parameter y is interpreted as (the 
set represented by) the tree T, then ¢(z, T) will be the formal Mitchell— 
Bénabou version of Vp(p is a point of T = Jn(p covers nA(...))). Here 
p and n are variables in ¢ corresponding to the variables u and v in ¢, 
whose type is the object Nodes(T) in € of nodes of T. (It is then possible 
to give a formal definition, by induction on the construction of ¢, of this 
translation ¢ + ¢ from set-theoretical language into the language of €.) 

Now write P for the object of points of S. Then, as shown in §5, 
there is a well-defined object 


R={peP|d(T(p),S)} 


of €. This object R can be viewed as a point of the tree Pow(S) described 
above, while ]( R) [as a subtree of Pow(S)] represents a set r witnessing 
the set-theoretical formula 


Va € S(d(z,y) <> TEr), 


where the parameter sets s and y are interpreted by the given trees S 
and T. 

Axiom of Infinity. We need to construct, in our given well-pointed 
topos €, a tree T representing a set N such that # € N and xz € N 
implies z U {2} € N. In terms of the tree T this means, first that there 
exists a point pp of T with the property that Tp, consists of po only 
(i.e., Tpg represents the empty set), and second that for any point p of 
T there exists a point g of T for which tg & (fp)*. Here ( )t is the 
operation on trees corresponding to the set-theoretic successor operation 
cr 2xU {zx}; that is, for a tree R, the tree R* is obtained by letting a 
new additional copy of R grow out of the root: 


e Re 
Be 


Rt: e 
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The standard (minimal) example of such a set N is “the” set of natural 
numbers, where the numbers are defined as sets themselves, by 0 = 9 
andn+1=nU{n}. In this model, each natural number is itself the 
set of all smaller natural numbers: n = {m € N | m<_n}. The tree 
T representing this set N has a root; above the root it has a point n 
corresponding to each natural number n; above such a node n it has the 
elements of n, i.e., natural numbers less than n, etc. Thus each node of 
T is labeled with a natural number n and the path to this node from 
its ancestral point is labeled with a strictly decreasing finite sequence of 
natural numbers. Thus, we can identify the nodes of T with such strictly 
decreasing sequences, provided we also interpret the root of T as the 
empty sequence (draw a figure!). Such a finite sequence can be modeled 
as a function f: N > N with f(i) > f(é+ 1) for all 7 € N such that 
f (i) #0: Such an f represents the sequence ( f(0)—1,...,f(€—1)—1), 
where £ is the first number with f(£) = 0. So the tree is the set 


T={fENN|WEN(f() 404 A) >fG+))} 2) 
and the order is given by 


fSg iff Vie N (f(t) 40> g(t) = fli). (3) 


In other words, f < g iff the finite sequence ( f(0) —1,..., f(@-—1) — 1) 
modeled by f is an initial segment of the finite sequence modeled by 
g- But clearly by the Mitchell~Bénabou language, (2) and (3) define a 
partially ordered object in any topos with a natural numbers object N, 
and one easily verifies that in our given well-pointed topos €, (2) and 
(3) define a tree representing a set N as required. 
Choice. In the language of set theory, the axiom of choice can be for- 
mulated as Vy(Vz € y(x # 0) > Af: y > Uy(ve € y(f(z) € 2))), 
where “f: y — Uy” is short for the set-theoretic formula stating 
that f is a function from y to the union Jy. Let T be a tree rep- 
resenting a set y with the property Vx € y (c« # ¢). This means 
that the sentence “for every point p of T there is a node ¢ in T cov- 
ered by p”, rewritten in the Mitchell-Bénabou language of €, is valid 
in €. Let S = {t € T | there exists a point covering t}, and let 
A: S + P= points of T be the ancestor map. Then A is epi, so since 
we assume that € satisfies the axiom of choice, the arrow A has a section 
o: P — §. Then is an arrow from points of T to points of the tree UT 
representirig the set LJ y, as described before. It is now straightforward 
to define from o a tree representing a function from y to Uy. (This 
depends on some model for the representation of the cartesian product 
of sets by trees; we omit the details.) 

This completes the verification of the set-theoretic axioms. We thus 
have back and forth constructions for models of RZC and well-pointed 
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topoi € with a natural numbers object and choice: 
SHE, EPS’. 


This gives the asserted equiconsistency of these theories. One can go 
further, and require, for example, that the constructions S + € and 
E+ S’, for a given model S of RZC, yield the same model S’ = S (up 
to isomorphism). This is indeed possible, but only with added axioms. 
For example, to show that every honest set S' in S appears as the set 
obtained from a tree, one needs the additional axiom of set theory stating 
that every set is contained in a (least) transitive set. (This axiom is a 
consequence of the replacement axiom of Zermelo—Frenkel set theory, 
and is much weaker than that axiom. Details are given in [Mitchell]; 
indeed, Mitchell’s formulation of this Mitchell-Bénabou language was 
done precisely in order to make possible this and all the above argument.) 


Exercises 


1. Let N, with arrows 0:1 — N and s: N — N, be a natural 
numbers object in a topos €. 


(i) Prove the following form of recursion “in a parameter”: for 
objects X and Y of €and maps g: X  Y andh: YxX > 
Y, there is a unique f: N xX — Y such that the following 
diagrams commute: 


1X PEW xX Ne X25 5N «KX 
-| | un| | 
xX —_,>—+ Y ¥ x X —,—+Y. 


[In Sets, this would mean that f is the function defined 
by f(0,2) = (2), f(n + 1,2) = h(f(n,2), 2), 

(ii) Use this result to define both addition and multiplication 
as arrows N x N —- N in €; define a subobject R Nx N 
which represents the order <. 


2. Prove that for a small category C, the presheaf topos Sets©”” is 
Boolean iff C is a groupoid (that is, iff every arrow of C is an 
isomorphism). 

3. For a 7, topological space X, show that the topos Sh(X) of 
sheaves on X is Boolean iff X is discrete. 

4, Let A be a distributive lattice with join V and meet A, and let 
A be a filter in A; that is, A C A satisfies le Asa >bEC AS 
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a€¢A,anda,b€ AS>aAbEA. The filter Ais proper when 
AFA. 


(i) Show that if A is maximal (among all proper filters, or- 
dered by inclusion) then for any a, b € A, a Vb € A iff 
either a € Aorbe A. 

(ii) If A is a Boolean algebra, show A is maximal iff for any 
a€A, eithera € Aor -a EA. 


. Observe that a topos € satisfies the (internal) axiom of choice iff 


every object of € is (internally) projective, see Exercises 15 and 
16 of Chapter IV. Rephrase some of the statements proved there 
in terms of the (internal) axiom of choice. Notice that if € is well- 
pointed then 1 is projective in €. Conclude from Exercise IV.16(c) 
that, for well-pointed topoi, [AC and AC are equivalent. Prove 
that if € satisfies IAC, then so does €/E for any object E of E. 
Is the same true for AC? 


. Let € be a nondegenerate elementary topos. 


(i) Show that € is well-pointed iff for every object X of € and 
every A € Sub(X), A= X iff every arrow 1 — X factors 
through A. 

(ii) If € is Boolean, show that € is well-pointed iff for any 
object X of €, X = 0 iff there is no arrow 1 > X. 

(iii) If € is Boolean and satisfies AC, show that for any maximal 
filter A of subobjects of 1 in €, the filter-quotient topos 
E/A is well-pointed and still satisfies AC. 


. Improve on Theorem 2.1 by showing that there exists a well- 


pointed topos € satisfying AC, with a natural numbers ob- 
ject N, in which there is an object X with N~>X>>PN but 
Epi(N, X) = 0 = Epi(X, PN). (Use the preceding exercise.) 


. Give an “elementary” definition of Epi(X, Y) > Y*, without us- 


ing Hom-sets. 


. Let € be an elementary topos, with subobject classifier 2. In §6, 


we proved the implication from right to left of the equivalence 
(15). Prove the implication from left to right. Also, give a more 
direct proof of the implication from right to left, by showing first 
that the subobjects {p | p} and {p| =p} of are, respectively, 
true: 1 >) and false: 1-4. 

Prove that an arrow f: X — Y ina topos € is a monomorphism 
iff the sentence Vx € XVa' € X(fx = fz' > x = 2’) of the 
Mitchell-Bénabou language holds in €. 
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11. 


12. 


13. 


14, 


Prove that the sentence 
Vralyd(x,y) > Af € Y* ved(a, f(a), 


where x and y are variables of type X and Y, holds for any two 
objects X and Y in any topos €. [This formula expresses the 
“axiom of unique choice”; as usual, Sly¢(x, y) is an abbreviation 
of 3y(4(w,y) AVz(6(a, z) + y = 2), 

Let X be an object in a topos €, and let R be a subobject of 
X x X. The “axiom of dependent choice” is the formula 


Vveay r(x,y) => Vaaf € XN (f(0) = 2 AVar(f(n), f(n +1), 


where N is the natural numbers object of €, x, y are variables 
of type X, n is a variable of type N, and r is the characteristic 
function X x X 12 of R. 


(i) Prove that if € is a presheaf topos, then the axiom of 
dependent choice holds in €, for any X and R. 
(ii) Prove that in the topos Sh(NN) of sheaves on the Baire 
space NN, the axiom of dependent choice holds for any X 
and R. 
(iii) Give an example of a topos € for which the axiom of de- 
pendent choice fails, for some X and RC X x X. 


In (4) of §8, we gave a definition of the object Ze of integers in a 
topos €. Define arrows 0, 1: 1 > Zg and +, -: Ze x Ze > Ze, and 
prove the commutativity of the diagrams which express that Ze, 
with these operations is a ring object in €; do this is such a way 
that for the special case where € is the topos Sh(X) of sheaves 
on a space X, this ring structure coincides with the evident one 
on Zsn(x) = A(Z). In a similar way, give a ring structure on 
Qe [see (5) of §8] generalizing the obvious ring structure for the 
case € = Sh(X). Also define subobjects of Z¢ x Ze and Qe x Qe 
which give a linear order on Ze and Qe, respectively. 

(a) Prove that in a presheaf topos € = Sets©””, the object 
Re of Dedekind reals is the constant presheaf A(R). 

(b) Let G be a group acting on a space X, and let € = 
Sh(X,G) be the topos of equivariant sheaves on X (as 
in Exercise 11 of Chapter III). Prove that the sheaf Re of 
Dedekind reals is the sheaf of those continuous real-valued 
functions which are constant along the orbits of G. 
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15. Let A be an open subspace of a topological space E. Let E+E = 


16. 


{ (e,i) |e € E,i € {0,1} } be the disjoint sum, with inclusions 7o, 
m: E> E+E. Let Q be the quotient space of E + E obtained 
by identifying (a,0) and (a@,1) if a € A, and let 7 be the quotient 
map. Show that A> F is the equalizer of mm and 77). Show 
that if + has a continuous cross-section, then A is also closed in 
E. 

The previous exercise was an introduction to this one; the aim is 
to show Diaconescu’s result that if an elementary topos E satisfies 
IAC, then € is Boolean. Below, m: A> E is a mono in €, and 
Mo, mn: E > E+£ are the coproduct inclusions. 


(a) If E satisfies IAC, show that for any epi p: B — C in 
E, there is an epi U -»1 in & such that U*(B)-+U*(C) 
has a section in €/U. (Here U*: € — €/U is the usual 
change-of-base functor.) 

(b) If U -»1 is an epi in € such that U*(A)V—U*(A) = U*(E) 
in Sub(U*(£)) (in the topos E/U), then also AV A= E 
in €. 

(c) Let 7: E+E — Q be the coequalizer of yom and mm. 
Prove that m is the equalizer of 77 and mm. 

(d) Suppose 7 has a section o:Q —- E x E. Prove that m 
is also the equalizer of orn and oxm, and (hence) also 
of tomy and tomm, where t: E+ E + 1+41 is the map 
induced by !: E -» 1. Conclude that A has a element in 
Sub(E). 

(e) Wrap up by concluding that € is Boolean if € satisfies IAC. 


VII 
Geometric Morphisms 


In this chapter, we begin the study of the maps between topoi: the 
so-called geometric morphisms. The definition is modeled on the case of 
topological spaces, where a continuous map X — Y gives rise to an ad- 
joint pair Sh(X) = Sh(Y) of functors between sheaf topoi. The first two 
sections of this chapter are concerned mainly with a number of examples, 
and with the construction of the necessary adjunctions by analogues of 
the ®-Hom adjunction of module theory. In a third section, we consider 
two special types of geometric morphisms: the embeddings and the sur- 
jections. For these two types, there is a factorization theorem, parallel 
to the familiar factorization of a function as a surjection followed by 
an injection. Moreover, we prove that the embeddings ¥ — € of topoi 
correspond to Lawvere—Tierney topologies in the codomain €, while sur- 
jections F — € correspond to left exact. comonads on the domain F. 

Again taking topological spaces as a model, one defines a “point” 
of a topos F to be a geometric morphism Sets — fF. In case F is 
the topos of sheaves on a site, say F = Sh(C, J), such points can be 
described in terms of set-valued functors on the category C and their 
tensor products, as shown in Section 5, where “flat” functors play a spe- 
cial role, comparable to the flat modules (those whose tensor products 
preserve exact sequences). In the next section we generalize this, replac- 
ing Sets by E so as to describe geometric morphisms € —-» Sh(C, J) in 
terms of functors C — €. As a consequence one obtains some explicit 
answers to the question: given sites (C,J) and (D,K), when does a 
functor C — D induce a geometric morphism Sh(C, J) + Sh(D, Kk), or 
perhaps Sh(D, kK) — Sh(C, J)? 

These ideas will also be used in the Appendix of the book, which 
deals with Giraud’s theorem and its remarkable applications. This the- 
orem asks when a category € is (equivalent to) a Grothendieck topos 
over some site; the answer is that € must satisfy certain (infinite) exact- 
ness conditions and must have a set of generating objects. The proof of 
this theorem extends the methods used in the present chapter. 
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1. Geometric Morphisms and Basic Examples 


The definition of a “map” between two topoi is based on the exam- 
ple of sheaves on topological spaces. We recall from Chapter IT that 
a topological space X determines a topos Sh(X); namely, the category 
of sheaves on X. Moreover, a continuous function f: X — Y between 
topological spaces gives rise to a pair of functors: an “inverse image” 
functor f* and a “direct image” functor f., with f* left adjoint to f,, 
as follows: r 

Sh(X) SS Sh(Y),  f" 4 fe. (1) 
Here the direct image functor f, was defined simply by composition 
with f~'; thus, if F: O(X) — Sets is a sheaf on X and U is any 
open subset of Y, then f,(F)(U) = F(f~!U). On the other hand, the 
inverse image functor f* was most readily defined in terms of the étale 


spaces corresponding to sheaves: if p: EF — Y is étale, then f*(E£ a Y) 
is the étale space over X defined by pullback along f, as in the diagram 
of spaces 


P(E) ——E 


rote 


X ———-Y. 


f 


From this description, it follows that f* preserves finite limits; ie., f* 
is left exact. 


Definition 1. A geometric morphism f: F — E between topoi is a 
pair of functors f*: E > F and f,: F — E€ such that f* is left adjoint 
to f, and f* is left exact. Then f, is called the direct image part of f, 
and f* the inverse image part of the geometric morphism. 


Returning to the example of sheaves on topological spaces, we re- 
mark that any geometric morphism Sh(X) — Sh(Y) in the sense of this 
definition necessarily comes from a unique continuous function X + Y 
between the spaces, at least if Y is sufficiently separated. For instance, 
suppose that Y is Hausdorff. Since f*: Sh(Y) — Sh(X) is left exact, it 
must send subobjects of 1 in Sh(Y) to subobjects of 1 in Sh(X). But 
such subobjects are in effect just open sets, so f* restricts to a function 
f*: O(Y) — O(X) which preserves finite limits and (as a left adjoint) ar- 
bitrary colimits; in other words, finite intersections and arbitrary unions 
of open sets. Now define a function f: X —» Y on the points x of X by 
setting 


f(x) =y iff x € f*(V) for all neighborhoods V of y. (3) 
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Given the point 2 of X, there can be at most one point y satisfying 
the right-hand side of the condition (3); indeed, if points y) # y2 both 
do, choose disjoint neighborhoods Vi and V2 of y,; and y2, so that x € 
f*(Vi) N f* (Ve) = f*7 VN V) = ba contradiction. Moreover, given 
x € X there is at least one y € Y satisfying the right-hand side of (3); 
for if not, then any y € Y has a neighborhood V, with x ¢ f*(V,), so 
¢ yey f*(Vy) = f*(U, Vy) = f*(Y) = X—again a contradiction. 


Therefore, (3) defines a function f: X > Y such that f (V) = f*(V) 
for every open set V in Y; it follows that f is continuous. Moreover, the 
geometric morphism induced by f is the same as the original geometric 
morphism f, up to natural isomorphism. Indeed, for any sheaf E on X 
and any open set V CY, 

1 


F.(E)(V) = EF 'V) © Sh(X)(F *(V), E) = Sh(X)(f*V, B), 


and, by the given adjunction, this is isomorphic to Sh(Y)(V, f,£) = 
f.(Z)(V). This proves that, up to natural isomorphism, every geometric 
morphism Sh(X) — Sh(Y) comes from a continuous function f: X — 
Y, at least if Y is Hausdorff. A weaker sufficient condition (Y sober) is 
indicated in Chapter IX. 

Earlier in the book, the reader has already met several other exam- 
ples of such geometric morphisms. For instance, any arrow k: B > A 
in a topos € defines a change-of-base functor of comma categories 
k*: E/A — €/B, by pullback. By IV.7.2, this functor k* has both a 
left adjoint 5°, and a right adjoint ]],. It thus yields a geometric mor- 
phism (again denoted by k) 


ki €/B > €/A (4) 


with k, = |], as direct image part and k* as inverse image part; the 
latter is indeed left exact by the existence of a further left adjoint }°,. 

As another example, recall that if j is a Lawvere-Tierney topology 
on a topos €, the inclusion functor Sh; € — € has a left exact left adjoint 
a, the associated sheaf functor (Theorem V.3.1). This gives a geometric 


morphism 
i: ShjE > €, (5) 


where i, is the inclusion functor and i* = a is the associated sheaf 
functor. Geometric morphisms of this form are called embeddings, and 
will be considered in more detail in a subsequent section. 

A further example comes from a left exact comonad (G,6é,e¢) on a 
topos €. We proved in Theorem V.8.4 that the category Eg of G- 
coalgebras is a topos, and that the forgetful functor U: Eg — E has 
a right adjoint, the cofree coalgebra functor. This functor U preserves 
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finite limits since G does (by construction of limits in Eg, see the proof 
of V.8.4), so this construction yields a geometric morphism 


p: E — Eg, (6) 


for which p* is the forgetful functor which sends a coalgebra (E£, k) to 
E, while p, sends each object E of € into the “cofree” coalgebra p, E = 
(GE, 6g: GE — G*E), where 6 is the comultiplication of the given 
monad. 

The next example of geometric morphisms concerns cross-sections. 
Recall that a (global) cross-section of a space p: E — X over X isa 
continuous map s: X — E such that ps = 1, as in the diagram 


in other words s is a map of the stcey 1 over X into E over X. The 
set ['E of global cross-sections of EF is thus Hom(1,£). Now suppose 
that € is a topos which has all small colimits; for instance, € could be a 
Grothendieck topos. In this case the global sections functor 


I: € > Sets, LE = Home(1, E) (7) 
has a left adjoint 


A:Sets>& AS=][[1, (8) 
8s€S 


defined for each set S as the coproduct of S-many copies of the terminal 
object 1 of €. With this definition, morphisms AS —> E in E clearly 
correspond to functions S — IE of sets, so that this functor A is indeed 
left adjoint to [. To show that this adjunction is a geometric morphism, 
we need to check that the functor A commutes with finite limits. Now 
clearly, Al = 1; ie., A preserves the terminal object. To show that 
A preserves binary products, notice that in any topos € the functor 
E x(—) preserves arbitrary sums, since it has a right adjoint (—)¥”, for 
any object E in €. Thus if { E, },cs and { F; }rer are indexed families 
of objects from €, we have 


(L] 2.) x (LJ) = L]@ « LT») 


sES teT s€S teT 
=] 25 
sESteT 


I] E, x EF. 


(s,tjeSxT 
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In particular, if E, = 1 = F, for all indices s € S and t € T, then 
AS x AT = A(S x T); 


so binary products are indeed preserved by A. To conclude the proof 
that the functor A is left exact, we show that it preserves equalizers. To 


a 
this end, take an equalizer R 4S =3T in Sets; we must show that the 
A 


corresponding diagram 
AR—**5 AS = AT 


is an equalizer in €. So consider a map u: E — AS = [[1. Because, 


8 
in a topos, coproducts are stable under pullback, this map wu gives a 
coproduct decomposition E = |] E,, where the object E, is obtained by 
8 


pulling back along u the s** coproduct inclusion 1 [][1 (a monic by 


8 
Corollary IV.10.5). Then u is just the coproduct of the (unique) maps 
E, — 1. In other words, a map u: E — AS is the same thing as a 
decomposition of E as an S-indexed sum [[E,. Now suppose that u 
equalizes Aa and Af. Thus, (Aa)ou = (AB)ou: E —- AT corresponds 
to a coproduct decomposition E = [], E, of E, indexed by t € T. Then 
for each s € S, we have FE, C Eis) and E, C Egy), so that E, C 
Ea(s) 1 Eg(s) in the lattice of subobjects of E. But by Corollary IV.10.5, 
one has that Eqs) M Egs) = 0 whenever a(s) # G(s); hence E, = 0 
whenever a(s) # @(s), ie., whenever s ¢ R. Therefore EF = ]],-p E,, 
and u: E — AS factors through Ae: AR — AS. This factorization 
must be unique, since Ae is monic (again by Corollary IV.10.5, since 
Ae maps AR into one component of a coproduct decomposition of AS). 
This shows that A preserves equalizers. 
We have now proved that when € has all small colimits the functors 
T and A of (7) and (8) constitute a geometric morphism 


y: E —- Sets, y=A, % =I. (9) 


Notice that for a given topos € there can be, up to natural isomorphism, 
only one such geometric morphism to Sets. Indeed, if f: E — Sets is 
any geometric morphism, then the fact that any set S has S & Sets(1, 9) 
and the assumed adjunction f* 4 fs together give 


fr E & Sets(1, f.£) = E(f"1, E) = E(1, E) = TE, 


for any object E of €. In other words, there is a natural isomorphism 
fx = x) just as claimed. 
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Our definitions give rise to a category with topoi as objects and 
geometric morphisms as arrows, because if g: G + F and f: F — € are 
both geometric morphisms, one can construct the composite fog: G > E 
by (f og)" = "0 f* and (f 09)+ = fx 09s (“adjoints compose” [CWM, 
p. 101]). The identity geometric morphism 1: F — F is obviously a 
two-sided identity for this composition. 

Actually, the set Hom(F, €) of all geometric morphisms f: F - € 
can itself be made into a category: Given two geometric morphisms f, 
g: F — €E, an arrow f — g is defined to be a natural transformation 
f* — g* (between the inverse image parts). One could equally well 
define a map from f to g to be a natural transformation g, — f,, since 
there is a bijective correspondence 


Nat(f*, 9") = Nat(gs, fx), 


where a natural transformation a: f* — g* corresponds to @: gx > fx 
as in the commutative square 


F(f*E, F) ——— €(E, f,F) 
ros.| [ecs2e (10) 
F(g"E, F) —~— E(E, 9. F), 


where the horizontal arrows are the given adjunctions. Indeed (10) de- 
termines @ in terms of @, and vice versa, by the Yoneda lemma. 

This definition gives for any two topoi F and € a category 
Hom(F, €). Moreover, a geometric morphism g: G — F yields a functor 


Hom(g, €): Hom(F, €) + Hom(G, €) (11) 


by composition with g, as follows: Hom(g,&) sends an object f of 
Hom(F,€) to f og, and a morphism a: ff — f3 of Hom(F,€) to 
the morphism g*a@ with components (g*a)e = g*(ag): (fig)*(E) = 
GP fLE > og fZE = (fog)*(£), for any object E of €. Similarly, a geo- 
metric morphism e: € —» D induces a functor 


Hom(F,e): Hom(F, €) + Hom(F, D) (12) 


by “precomposing” with e. Moreover, these various compositions satisfy 
appropriate associativity and identity laws so as to yield a so-called 2- 
category of topoi [CWM, p. 44]: The objects are topoi, the 1-cells are 
geometric morphisms, and the 2-cells are natural transformations a as 
above. (Therefore, if f;, fo: F — € are geometric morphisms, one often 
writes 


a: fi => fa (13) 
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to denote a 2-cell; i.e., a natural transformation ff —> fj.) 
To give an example, recall that the points of a topological space X 
are partially ordered by 


x <a’ iff every neighborhood of x contains 2’. 


(The order is trivial if X is Hausdorff, or 7, but is nontrivial and useful 
in algebraic geometry.) This definition thus induces a partial order on 
the set Cont(X, Y) of continuous functions X —- Y: 


f<g iff f(c) <o(x) foralle eX 
iff f-*(U) C g 1 (U) for all open U in Y. 


So if f < g, then for any sheaf E on X, the restrictions E(g-'U) - 
E(f~1U) for open subsets U C Y together constitute a natural trans- 
formation 9. — f,, or equivalently by (10), a natural transformation 
f* — g*; that is, a map f > g in Hom(Sh(X),Sh(Y)). Thus, if we 
view the poset Cont(X, Y) as a category in the usual way, we obtain a 
functor Cont(X, Y) —> Hom(Sh(X), Sh(Y)). 

For other examples, see the exercises. 


2. Tensor Products 

A next example of a geometric morphism is provided by sets with a 
group action; it depends on the adjunction constructed in Theorem [.5.2. 
That adjunction is analogous to the familiar adjunction between “Hom” 
and “Tensor” for modules. This section will develop some notation re- 
flecting this analogy, to help calculations with the direct and inverse 
functors involved in such geometric morphisms. In particular, this will 
include a “tensor product” of set-valued functors, left adjoint to Hom. 
Recall the basic adjunction in Sets, with product left adjoint to expo- 
nent by the natural isomorphism 


Hom(X x Z,Y) & Hom(X,Y7), (1) 
which sends each function f: X x Z— Y intot: X + Y? with 
f(z, z) = (tx)(z), reExX, z2EZ. 


However, this is not a geometric morphism Sets — Sets, because the 
left adjoint — x Z is not left exact (it does not preserve products!). 

There is an analogous adjunction for modules. For rings R and S 
consider three modules 


Xs, sZR; Yr 
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(i.e., X is a right S-module, Z a left S-right R-module and Y is a right 
R-module). In this situation, there is a corresponding adjunction 


Homr(X ®s5 Z,Y) = Homs(X, Homp(Z, Y)); (2) 


here Homr denotes the set of R-maps, while Homp on the right is the 
S-module of R-maps Z — Y. In this adjunction (2) (called “adjoint 
equivalence” in [Mac Lane, 1963, p. 192]), corresponding morphisms 
f:X@gZ-—-Y andt: X — Homp(Z, Y) are related by the equation 


f(x@®z) = (tx)(z), rEX, zE€@; (3) 


this is essentially the same formula as in the adjunction (1). 

The next section will show how this “tensor-Hom” adjunction applies 
in many cases: to sets with a group action or with a continuous such 
action and to objects in a topos with an action by an internal group 
or an internal category. Each of these cases will involve the definition 
of an action on hom-sets and the construction of a tensor product as 
a left adjoint to such hom-sets. In particular, these constructions yield 
examples of geometric morphisms. 

In this section we will use this tensor product to construct a geo- 
metric morphism between presheaves. For this purpose Theorem [.5.2 
provides a model adjunction. Given a small category C, a cocom- 
plete category € and a functor A: C — €, consider the “Hom-functor” 
R: € + Sets©”” defined by 


R(E)(C) = Home(A(C), £), Cec, EEE. (4) 
It has a left adjoint L which preserves colimits, 
Sets” > £, L=La, R=Ra (5) 


and defined for each presheaf P as the colimit 
La(P) =1i 62 
a(P) =lim( [ P= 6 +6), (6) 


where f P is the category of elements of P, as defined in §1.5. 
We next observe in (11) below that this L.4(P) is much like a tensor 
product of the two functors 


P: C°? — Sets, A: C+ €, 


(i.e., P a right C-module, A a left C-module). First recall ((CWM, 
p. 109]) that all limits can be constructed from products and equalizers; 
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dually, all colimits can be constructed from coproducts and coequalizers, 
and this in a canonical way. For consider the colimit of any functor 
H: J > € to € from some index category, with the injections K;: H(i) > 
LL, #(@), ¢ € J, into the coproduct. Then a morphism ¢: |], H(i) - E 
from this coproduct is uniquely determined by the set of its components 
o: = k;; however, these components ¢; will form a cocone over H to 
the vertex E only when the conditions 


(@;)H(u) = dni: H(i) > E (7) 


hold for all the arrows u: i -» j of the index category J. So take the 
coproduct of all H(dom wu) for all arrows u with its injections ,, and 
construct two arrows 6 and 7 and their coequalizer ¢ to fit in the diagram 


H(dom wu) H(i) 
eS \ or 
| | se (8) 
[] #(domu) = | [2 3-9 E; 
u: inj 


here @ and 7 are defined in terms of the injections ,, and «,; for each 
u: t — j by the conditions 


On =i,  TAy = Kj H(u): H(i) > H(j) ) > L#0. (9) 


Then the equation 66 = ¢7 states exactly that the arrows ox, form a 
cocone (8) over H to the vertex E, while the fact that ¢ is the coequalizer 
of @ and 7 states that this cocone is the colimiting cocone for H. In other 
words, the colimit of the arbitrary functor H can always be presented 
as the following coequalizer: 


I] H(dom u) #0 —* lim H. (10) 


u: taj 


Now apply this presentation to the case when H is the functor Arp 
of (6). Then the second coproduct is taken over all the objects (C, p) 
with p € P(C) of the category { P, while the first coproduct is over 
the maps u: (C’,p’) - (C,p) of that category, so that u: C’ + C and 
pu =p’. Therefore the colimit L4(P) in (6) can be presented as the 
following coequalizer: 


al A(C’) 3 I] A(C Cj 14(P )=P@cA 
C,peP(C) C,peP(C) (11) 
u: CoC 
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in €. Here, as in (9), @ is defined as the map which takes the summand 
A(C") indexed by (p,C’,u) via 1: A(C’) —+ A(C’) into the summand 
indexed by (C’, p' = pu), while 7 sends this same summand A(C’”) via 
A(u): A(C’) + A(C) into A(C), indexed by (C, p). 

The “Hom-functor” Ra of (4) thus has a left adjoint L4 which is 
“like” a tensor product — ®cA. Next we illustrate this analogy by 
examples of this coequalizer. 

First take C = 1 and € = Sets. In this case both functors A and P 
are simply sets, and the coequalizer becomes 


[4S []4— Pea. 


peEP peEP 


Here the coproduct U,4 is just the set of all pairs (p,a) of elements 
p € P,a€A, so P®@A is P x A, while the right adjoint R of (4) 
is just P +> Hom(A, P). Thus in this case the adjunction (5) restates 
the familiar fact that the product — x A is left adjoint in Sets to the 
exponential (_ )4 = Hom(A, —). 

Next, for any category C, take € = Sets. Then the data consists of 


two functors 
P: C°P _+ Sets, A: C — Sets. (12) 


In particular, if C is a group G (a category with one object, all arrows g 
invertible), P is a right G-set and A is a left G-set, while the coequalizer 
(11) is 

Px Gx A= Px A—* P@GA, 


where 4(p, g, a) = (pg, a) and r(p, g, a) = (p, ga) for all elements p € P, 
g € G, and a € A. If we write each ¢(p,a) as p® a, this means that the 
set Pg A consists of elements p® a subject to the equality 


pg @a=pOqga. 


This is just like the tensor product of G-modules Pg and gA, but with 
no additivity condition. 

Generally, for any category C the coproduct |], A(C) of sets in (11) 
is just the product P(C’) x A(C) for C € C. The coequalizer (11) is thus 
the definition of a “tensor product” P@A of the set-valued functors 
(12): 


[| P(C) x Hom(C’, C) x A(C’ = HPO C) x A(C) 25 Pac A, 
c,c’ 

(13) 
where, for elements p € P(C), u: C’ > C and a’ € A(C’), 


O(p, u, a’) = (pu, a’), T(D, Us a’) = (p, ua’). 
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This definition is symmetric in P and A. The elements of the set P@c A 
are then all of the form ¢(p, a). We write such an element as 


o(p,a)=pSa, for pe P(C), a € A(C). 
Then, in virtue of the definitions of 9 and 7 above, we have 
pu®a'=p®eua', peP(C), uC’ +C, a'€ A(C’). (14) 


In other words, the set P @c A is the quotient of the set ]], P(C)x A(C) 
by the equivalence relation generated by these equalities (14)—just as 
in the tensor product X @gs Z in (2) of a right S-module X by a left 
S-module Z. 


Theorem 1 (Hom-®). For functors P and A as in (12) the functor 
Ra: Sets > Sets©”” defined for each set E and each object C of C, as 
in (4), by 

Ra(E\(C) = Homsets(A(C), E) 


has a left adjoint L4 defined for each presheaf P as the equalizer P®g¢ A 
of (13). 


This adjunction 
—-@cA=La: Sets©” = Sets: R4 = Hom(A-.,...) 
is expressed by the isomorphism (where € is to be read as Sets) 
Home (P 8c A, E) = Nate(P(C), Homg(A(C), E)), (15) 


natural in P and EF; here Nate designates transformations natural in 
the object C. This isomorphism (15), and hence the adjunction, is a 
consequence of Theorem I.5.2 and the representation above of the colimit 
there as a coequalizer. It can also be seen as a direct. consequence of the 
coequalizer definition (13) of the tensor product ®c. 

Indeed, specify that arrows 


f: P®cA-E, tc: P(C) > Home(A(C), E) 
correspond under the bijection (15) when they are related by 
fc(p, a) = te(p)(a), Cec, peP(C), ac A(C). (16) 


Here we regard f as a function on [][, P(C) x A(C) with components 
fo: P(C) x A(C) — E which satisfy 


fo: (pu, a) = fo(p, ua) (17) 
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in accord with the equivalence relation (14). It then follows formally 
that such an f does by (16) define a family t¢ which is natural in C and 
conversely that such a tg determines by (16) a family fc satisfying (17). 
This gives a direct proof of the Hom-® adjunction (15) for € = Sets. 

If the category Sets is replaced by any cocomplete category €, with 
A now a functor C — &, the cocompleteness of € provides the same 
definition of @c as a coequalizer and the same adjunction holds again as 
a special case of Theorem I.5.2. The proof is essentially the same, when 
the element a in the set A(C) of (16) is replaced by a generalized element 
a: W —»+ A(C) from some (parameter) object W of €. Then f is regarded 
as a function [|], ,, A(C) — E with components fo: A(C) — E for 
each p € P(C’) which satisfy 


fo’ pu = fop 0 A(u): A(C’) + A(C) — E (17bis) 


for all arrows u: C’ —-» C. The condition (16) defining the bijection then 
reads 
fop°a=tc(p)oa:W > E. (16bis) 


This argument, like others using a “generalized element” a € 
Hom(W, A(C)), is natural in the object W. Hence, as in other uses of the 
Yoneda lemma, it is determined by setting W = A(C) with a the iden- 
tity map. This gives the bijection (16bis) the simpler form fco,p = tc(p). 
Since this correspondence (16) is formally like that for sets, as in (1) 
above, or that for modules, as in (3) , we call (15) the general “Hom-@ 
adjunction”, as follows. 


Theorem 1 [bis]. The adjunction (15) holds for cocomplete €. 


Next, consider the case with two categories C and D, where A is 
a bifunctor A: C x D°P -» Sets (here € = Sets). The tensor product 
P@c A is then not just a set but a contravariant functor on D to sets 
defined on objects D of D by 


(P®c A)(D) = Pc A(—,D), 


while the right action of any arrow h: D’ — D is defined, in agree- 
ment with the identification (14), by (p®a)h = p@(ah). Similarly, any 
Q: D°P — Sets gives a functor Hom,(A,Q): C°? — Sets, where for 
each object C the set Hom,,(A,Q)(C) is the set of all natural trans- 
formations 6: A(C, —) — Q of contravariant functors on D (i.e., mor- 
phisms of presheaves on D). [Observe that Homp(—,—) denotes a 
set while the underlined Homp(— , — ) denotes a functor—much as in 
the more elementary case (2) for bimodules. Recall that in §1 (11) 
Hom(F, €) is also “enriched” as a category.] For these three functors 


P: C°P -++Sets, A: Cx D® — Sets, Q: DP — Sets 
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the Hom-® adjunction now reads 


Homp(P &c A, Q) Y Home (P, Homy (A, Q)). (18) 


Explicitly, morphisms ~: P®c A — Q of presheaves on D correspond 
in this bijection (18) to morphisms 7: P — Homy(A,Q) of presheaves 
on C when they satisfy the familiar identity, rewritten from (16), 
y(pSa)=r7(p)(a), pEeP(C), a€ A(C,D) (19) 
for all objects C and D. Thus, the bifunctor A yields an adjunction 
~ @c A: Sets©” —— SetsP” : Homp(A, — ). 


But this need not be a geometric morphism of topoi, because — @c A 
need not be left exact——just as for an R-module A, the tensor product 
— ®p A is not always left exact. However, for R-modules, the identity 
functor — @z RF is left exact; a corresponding result holds in our case 
if we replace the ring R (a bimodule) by the category D, regarded as a 
bifunctor. 

Indeed, for any category D, the hom-sets, reversed, constitute a 
bifunctor, which we write as °D°, or sometimes as Mohp: 


*D® = Mohp: D x D®°? —> Sets; °D°(D, D’) = Homp(D’, D). 
For any presheaf Q: D°P —+ Sets there are then canonical isomorphisms 
Q ®p °D® ~ Q = Homp(°D’*, Q) (20) 
obtained from the identity 1: D — D just as for the familiar isomor- 
phisms X ®p R = X = Home(R, X) for an R-module X over a ring R. 
It is sometimes convenient to write the first isomorphism of (20) as 
Q@Mohp =Q, Q®Mohp(-,D) = Q(D), (21) 
where 
Mohp(D, D’') = Homp(D’, D), Mohp(D, —) = y(D). 


If ¢: C + D is any functor, each presheaf Q: D°P — Sets on D deter- 
mines by composition a presheaf Qy = Qo ¢°P: CoP —+ D°P -» Sets on 
C. Similarly, a functor B: D > Sets yields ,B = Bog: C — Sets; 
for instance, *D® yields ,D®: C x D°P — Sets. Applying this to the 
canonical isomorphisms (20) above yields isomorphisms 

Q @p °Dg = Qs = Homp(4D*, Q). (22) 

Theorem 2. A functor 6: C — D induces a geometric morphism 
: Sets©” — Sets?” 


for which the inverse image ¢* takes each presheaf Q on D to the com- 
posite *Q = Qo g°P = Qy. Moreover, ¢* has a left adjoint ¢y. 
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Pru According to the canonical isomorphisms (22) the functor 
¢* is bea tensor product (with *D,) and a Hom-functor (from 4D°). 
Hence the basic Hom-® adjunction in these two representations of 
¢*, it wboth right and left adjoints, which are respectively 


bx =2M¢(*Dg, — ), ¢1 = ~ ®c(4gD*): Sets©” — Sets?”. 


Thore, d*, as a right adjoint of ¢;, must be left exact, so that ¢, 
has a srexact left adjoint ¢*, and thus is geometric. 


A psmetric morphism with the special property that its inverse 
image xt has a left adjoint (and hence is necessarily left exact) is 
called usssential geometric morphism. Thus we have shown that any 
functa: C — D induces an essential geometric morphism on the 
correspuing categories of presheaves. Our argument for this result 
depend directly upon the Hom-@ adjunction (15), which was used 
both tsuggest the adjoints ¢, and ¢;, via (22), and then again to 
estableach of the corresponding adjunctions. 

In us theorem one may replace the category Sets (in Sets©” 
etc.) bitopos €; then C and D are to be replaced by category objects 
(intern -ategories) in €. We will carry this replacement out in the next 
sectior dy for group actions, that is, in the special case when C and 
D are“aced by group objects G and H in the topos €. 

Fo: ference, we summarize the several cases of tensor products. 
For € «mplete and C small there is a tensor product 


@c: Setso” x ECE, (RA) R@CA. (23) 
This wiilso appear in (7.12). For € = Sets, this specializes to 
c: Sets©” x Sets© — Sets, (P, A) P@cA, (24) 
as intraced in (11) above. This definition is not symmetric in P and 
A, but:vmmetric form appears in (13). This includes the case when 
C is asip G. More generally, if K is a group object in the cocomplete 
catego: :. (3.1) below will define a tensor product 
ax: (BK)? x BK + E, (X,Z) > (X @x Z). (25) 
Finall; 3) above considers the case in which one factor is a bifunctor: 


Bc ss-” x SetsCxP” —, SetsP””, (P,A) > P@cA. (26) 


Of couxthere can be cases in which both factors are bifunctors, etc. 
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3. Group Actions 


Our objective is to show that a morphism ¢: G -» H of group objects 
induces an essential geometric morphism between the categories of right 
G- and right H-objects in a topos €. 

First, consider in the topos € a group object K, a right K-object X 
with its action-map ay: X x K — X and a left K-object Z with its 
action map Gz: K x Z — Z. Their tensor product over K is defined to 
be the object X ®x Z specified by the following coequalizer in €, 


axxl e 
Xx Kx Z=HAX x Z—* +X Ox Z. (1) 
XPZ 


This is essentially just the set-theoretic definition; for a generalized el- 
ement of X x Z (an arrow from a “parameter”-object U —» X x Z) 
is just an ordered pair (x, z) of generalized elements x: U — X and 
z: U + Z. If we write x @ z for the element e(x, z): U > X @x Z, then 
the definition (1) amounts to the familiar identity 


th@z=x@kz (2) 


for any generalized element k: U — K of K; here we have written rk 
for ax 0 (x#,k), and kz analogously. Formally, the coequalizer definition 
(1) means that an arrow ¢: X ®x Z —» W to any object W can be 
uniquely defined by giving an arrow ¢’: X x Z — W such that for 
any generalized elements x, k, z defined over any U (i.e., 2: U + X, 
k: U + K, z: U > Z), the identity 


$' (x, kz) = $'(ak, z) (3) 


holds, much as in (17) of §2. 

Now suppose in addition that G is another group object in € and 
that Z = xZg is both a left K- and a right G-object in €, with the 
right G-action ax: Z x G — Z commuting with the given left K-action. 
Then, much as for modules, the tensor product X ®x Z has a right G- 
action. To define it, observe that the functor — x G has a right adjoint in 
€ (the exponential), hence preserves coequalizers, so that the top row in 
the following diagram is a coequalizer with bottom row the coequalizer 
definition (1) of ®g: 


ax1xl 


Xx Kx Zx% @aS3 xX x Zx G15 (XO Z)xG 


1xBx1 
1x1xaz |r Va (4) 
X x K x ZS X x Zz > X OZ. 
Since both squares on the left evidently commute, a vertical action map 
a is uniquely defined between the coequalizers, as shown dotted on the 
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right. One can exhibit a in terms of generalized elements as a(x ® z,g) = 
x ® zg; in other words, the action of g is given just as for modules by 


(x@z)g=r@zg (2: U + X,z:U > Z,g:U >G). 


The needed property for the action by a product 9192 then follows; one 
may also observe that the two vertical maps in (4) make X x K x Z and 
X x Z into right G-objects. One may also show that this new tensor 
product is associative: (X @x Z) @gGW = X ®x(Z@gqW) for suitable 
X, Z, and W. 

A Hom-object in € is defined similarly. For right G-objects Z and Y 
with actions az and ay, define the (enriched) Hom-object Homa (Z, Y) 
as the following equalizer 


Hom,(Z,¥) —+Y” ——3 y?*¢ (5) 


in €, where v and w are the transposed maps of the following maps 7 
and w, respectively: 


6: Y% x Zx Gla yV? x Z—M Sy, ' 
(6 


@:Y%xZxGHyYx Gy, 


and “ev” is evaluation. More explicitly, for generalized elements t: U > 
Y7 and z: U + Z we will write t[z] for the evaluation ev(t,z): U — 
Y%xZ-—Y. Then the maps in (6) are given for g: U — G by d(E, z,g) = 
t[z-g] and w(t,z,g) = #[z]-g; in other words, a generalized element 
t: U + Y@ lies in (factors through) Hom¢(Z, Y) exactly when t[zg] = 
t[zlg for all z and g, just as in the definition of a natural transformation 
t of functors of G. [Notice that the quantifier “for all z and g” should 
be read in the style of Kripke—Joyal semantics as: for all 6: V > U and 
all generalized elements z: V  Z, g: V — G, so that t[{zg] stands for 
(t8) [zg] = 0(EB, z,g): V > Y, and similarly for ¢[z]g.] 

If Z is also a left K-object, then a right K-action on Hom,(Z, Y) 
can be defined in the expected way. Indeed, since the product func- 
tor — x K preserves equalizers (“limits commute with limits” [CWM, 
p. 210]), both rows of the following diagram on the definition (5) con- 
stitute equalizers, 


1 
Hom,(Z,Y) x K ——> Y? x KY?" x k 
wxl 


_— 


Vv 


Hom,(Z, ¥) ————> Y¥ == 3 ¥7"E, 


| 
| 
| 
l 
v 
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where the vertical maps a and 0 are right K-actions defined, respectively, 
as the transposes of the maps 


@: Y? x K x g—_**82__dgd p27 x ge, 


B: YAXE) x Kx Zx GARE, ySx® x (Z x GY) Y. 


Both squares on the right in (7) commute. For example, to show that the 
square with v’s commutes, first take the transpose; the left-bottom map 
is then ev 0(ax1)o(1x1xaz) = @(1x1xaz) =ev(1 x Bz)(1x 1x az), 
and this is equal to the top-right composite, using the definitions of band 
v. By this commutativity, there is, therefore, on the left a dotted vertical 
map as indicated; one checks that this makes the object Hom,(Z,Y) a 
right K-object in €. [In fact the maps a and 6 make Y? and Y2*® into 
right K-objects in (7), and v and w are maps of right K-objects.| This 
definition (7) of the action by K on Hom,(Z, Y) can also be stated in 
terms of generalized elements t: U + Hom,(Z,Y) and h: U > K, for 
then t-h is given by (t-h)[z] = t[h- z], just as for functions in the familiar 
case where € = Sets. 

For group objects G and K in the topos € and objects in € with 
actions as indicated by Xx, «Zg, and Yg, we now establish the basic 
Hom-tensor adjunction in the familiar form 


Homg(X ®x Z,Y) = Homx(X,Hom,(Z,Y)), (8) 


natural in all three arguments. [Notice that (8) is an isomorphism of sets: 
Homg and Hom, denote the sets of G-maps and K-maps, respectively, 
while Hom, is the K-object as just defined.] If a G-map f: X @x Z > 
Y corresponds to a K-map t: X — Hom,(Z, Y) under the isomorphism 
(8), then f and ¢ are related by the familiar formula in generalized 
elements x and z 


f(2® 2) = (to2)(z). (9) 
More explicitly, given t: X — Hom,(Z,Y), (9) defines first f: X x Z > 
Y, and then f on X ®x Z, since the necessary property f(rk®z) = 
f(z ® kz) follows because t is a K-homomorphism, while the fact that f 
is a G-map follows by the usual arguments, because each ¢ o x is in the 
subobject Hom,(Z,Y) of Y7. This proof is thus essentially the same as 
that when G and H are category objects in Sets (Theorem 2.1). Note 
also that in the correspondence (8), the map f taken as X x Z + Y 
is indeed adjoint to t taken as X — Y%, so (8) is the restriction to 
the appropriate subsets of the familiar adjunction between product and 
exponential given with the very definition of a topos. 

Now, as for categories in Sets (Theorem 2.2), consider a morphism 
¢: G — H of group objects in €. The right G- and right H-objects in the 
topos € constitute topoi BeG and Be H respectively, by Theorem V.6.1. 
(We consider right actions here, so write BeG for E°°”.) 
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Theorem 1. A morphism ¢: G — H of group objects in a topos 
E induces three functors on the topoi BgeG and BeH of right G- and 
H-objects in E: 


és: BeG > BeH, ¢*: BeH — BeG, og: BeG > Bed, 


with $* left adjoint to ¢, and ¢, left adjoint to ¢*. Therefore ¢ induces 
an essential geometric morphism 


¢:BeG Be (10) 


with direct image $, and inverse image ¢”. 


Proof: Each right H-object Y determines by ¢ a G-object ¢*Y; 


1x@ 
namely, the same object Y with the right G-action Y x G cali 


Y x H —— Y; we also write ¢*Y = Yy. In particular, the left H- 
right H-object H becomes a left H- right G-object H,. Then, just as 
in (22) of §2, there are canonical isomorphisms of G-objects 


Hom; (gH,Y) = ¢*Y = Y @u Hg. (11) 


Thus ¢* is both a Hom-functor Hom;;(4H, —) and a tensor product 
functor, so by the basic Hom-tensor adjunction (8) has both right and 
left adjoints 


ob. =Hom;(H3,-), $= — @x(gH). (12) 


This completes the proof. 


This theorem applies in particular to a homomorphism ¢: G > H 
of discrete groups G and H (i.e., group objects in € = Sets). It yields 
a geometric morphism ¢: BG — BH between the topoi of right G-sets 
and right H-sets. The latter result can be generalized in another way, 
by considering topological groups. Let G and H be topological groups, 
and let ¢: G —» H be a continuous homomorphism. BG and BH then 
denote the categories of continuous right G-sets and right H-sets, as in 
§I.1 and §III.9. One may still use ¢ to define a functor 


¢": BH + BG 


exactly as before: for a continuous H-set Y, ¢*(Y) is the same set Y 
with the induced G-action 


Y x64. 7 <p Sv 


which is again continuous. This functor ¢* clearly preserves finite limits 
because finite limits in BG are computed as finite limits of the underlying 
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sets (more precisely, the forgetful functor BG — Sets creates finite 
limits). Defining the left and right adjoints of ¢*: BH — BG creates 
some problems, however, since simply adopting the above construction 
for discrete groups doesn’t provide us with continuous H-sets. This 
problem can easily be circumvented in the case of the right adjoint ¢,, 
in the following way. Let G® and H® be the groups G and H equipped 
with the discrete topology. Then the inclusion 


BG —— BG? 


of continuous G-sets into arbitrary G-sets is full and faithful. On the 
other hand, for any right action 6: X x G —- X of the (discrete) group 
G on a set X, each point x € X determines its isotropy subgroup 


I,={g€G|ag=r} CG; 


moreover, as in §ITI.9(2), the given action 8 on X is continuous iff every 
I, is open in G; equivalently, for any two points 2; and x2 of X, the set 
of all g € G with 21g = x2 is open. Thus if one considers the subset 


p(X) ={x2 eX | I, is open}, (13) 


then the action of G on X restricts to a continuous action on the subset 
p(X). Hence (13) defines a right adjoint p: BG’ — BG to the inclu- 
sion BG — BG? above. Since this inclusion is readily seen to be left 
exact (pullbacks are preserved by inclusion) this adjoint pair is itself a 
geometric morphism 

pc: BG’ > BG (14) 


with direct image (pg). = p and inverse image pg* the inclusion. 

It follows that ¢*: BH — BG has a right adjoint: write ¢°: G® > 
H® for the corresponding homomorphism of discrete groups. Then we 
have functors 


BG —* BH 


| 00 vn] (15) 


5 *)" 5 
BG’ ———; BH’, 
(6°). 


and clearly (pq@)*og@* = (¢°)*o(px)*. We define the desired right adjoint 
gs as the composite 


ps = (px) «(¢°)«(pa)*. 
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Then ¢, is indeed right adjoint to ¢*, since for a continuous G-set X 
and a continuous H-set Y we have 


Homy(Y, (ox) +(¢°)«(eG)*(X)) & Homye(on*(Y), ($)«pa*(X)) 
& Homgs ((¢°)*px*(Y), pe*(X) 
= Homgs (pg*¢"(Y), pa" (X)) 
~ Home (¢*Y, X), 


— 


the last isomorphism because the inclusion pg* is full and faithful. This 
shows that the continuous homomorphism ¢: G — H of topological 
groups gives rise to a geometric morphism 


¢: BG + BH (16) 


with ¢* and ¢, as just defined. 

Unlike the case of discrete groups, however, this geometric morphism 
need not be essential. In fact, the geometric morphism pg: BG’ + BG 
described in (14) above, which comes from the continuous homomor- 
phism pg: G> — G given by the identity function, need not be essential 
(Exercise 7). 


4. Embeddings and Surjections 


A geometric morphism f: F — € is said to be a surjection when its 
inverse image functor f* is faithful; f is said to be an embedding when 
the direct image functor f, is full and faithful (or equivalently, as for any 
adjunction, when the counit e: f*f, —> 1 is an isomorphism; cf. [CWM, 
p. 88]). A typical example of a surjection is the geometric morphism 
p: F — Fg constructed from a left exact comonad G on F, as in (6) of 
§1. Here p* is the forgetful functor, evidently faithful. A typical example 
of an embedding is the geometric morphism i: Sh; € — € for a Lawvere- 
Tierney topology on a topos €, as in (5) of §1. Here i, is the inclusion 
of j-sheaves into €, clearly full and faithful since a morphism between 
j-sheaves is by definition just a morphism between the corresponding 
objects of €. 

In this section, we will prove that, up to equivalence of topoi, every 
embedding is of the form Sh; € — € and every surjection is of the form 
F — Fg. We will also show that an arbitrary geometric morphism can 
be factored as a surjection followed by an embedding, in an essentially 
unique way. 

Here are some examples of surjections. If f: X > Y is a continuous 
function between T; topological spaces, then f is a surjection (of spaces) 
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iff the corresponding geometric morphism f: Sh(X) — Sh(Y) is. For 
consider the commutative diagram 


sn(y) —2— sh(x) 


[| ; 


OW) a> O(%), 


where the vertical inclusions come from the identification of open sets 
of Y with subobjects of the sheaf 1 € Sh(Y), and similarly for X. Now 
suppose that a point y € Y is not in the image of f. Since Y is assumed 
to be 7}, this point y is a closed point, so Y — y is open and f~'(Y ~y) = 
f—l(Y). Therefore, Y ~y = Y since f~? is faithful, a contradiction. The 
converse proof does not need the T)-condition. For if f is onto, consider 
maps a, 3: E — F of sheaves on Y such that f*a = f*§. Then for any 
point z € X we may take the stalk at x and conclude that (f*a), = 
(f*)e: (f*E)2 — (f*F)z. But (f*E)2 = Eye and (f*a)2 = (2), ete. 
So if f is onto then ay = By: E, — F, for any point y € Y, soa = £. 
(For a slightly different argument, see Proposition IX.5.5.) 

For a set, S, a sheaf on the space S (with the discrete topology) is 
the same thing as an S-indexed family of sets, or (cf. §1.1) as a function 
E — § of sets, ie., an object of Sets/S. So, as a special case of what 
we have just shown, a surjection of sets f: S — T induces a surjective 
geometric morphism Sets/S — Sets/T (because f may be viewed as a 
surjective map of discrete topological spaces). 

More generally, any morphism k: B — A in a topos € induces a 
geometric morphism k: E/B — &/A, as in (4) of §1. The inverse image 
functor k*: €/A — €/B is given by pullback along k, and we will now 
verify that this pullback functor is faithful if k is an epimorphism in €. 
Indeed, if EH — A and F — A are objects in €/A and f: E > Fisa 
map between them in €/A, then one can construct the pullback squares 


Bx,E—@3E 


ipens| | 


Bx,zF—2=+F 


| | 


Be A 


and the projections 72 and 7 are epi since k is, by Proposition IV.7.3. 
Thus, if g: E — F is another arrow in €/A such that k*(f) = k*(q), 
then fai = mak*(f) = mk*(g) = gm, 80 f = 9 since 74 is epi. This 
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shows that k*: €/A — €&/B is faithful if k is epi, as asserted. Thus, 
an epimorphism k: B — A induces a surjective geometric morphism 
E/B > E/A. 

There is a similar sequence of examples of embeddings. Consider 
first the injection 1: Y — X of a subspace Y of a topological space X, 
Then for a sheaf FE on the subspace Y the counit of the adjunction 
ix: Sh(Y) @ Sh(X) :7* can be calculated on the stalk of a point y € Y 
from the definition IT.5(7) of the stalk in terms of germs as follows: 


[i*ix(E) ly = [ix(E)] icy) 


a 


Ee i,(E)(U) (U open in X) 
= lim, E(i1U) (definition of i,) 
Slim = E(V) (V open in Y) 


where the second-but-last isomorphism comes form the fact that open 
sets V in the subspace Y are all of the form V =UNY =77'(U) for 
some U open in X. Thus, i*i, & 1, so i is an embedding. 

As before, the special case where X and Y are spaces with the dis- 
crete topology shows that an injection m: S —T between sets gives an 
embedding of topoi Sets/S —> Sets/T. 

More generally, we claim that a monomorphism k: B — A in a topos 
E gives an embedding k: €/B — E€/A of slice categories. To see this, 
recall first that there are adjoint functors 


€/B ——E/A, De 1k* AT = ky, 


as in Theorem IV.7.2. For an object E — B of €/B, the composition 
k*De(f: E —- B) is computed by first composing with k and then pulling 
back along k; in other words k*,,(E — B) is the left-hand arrow in the 
pullback square on the left below: 


e———— EF E—1 6B 
| f f 
B B— +B 
cE | E 
BoA BA. 


The top square on the right is always a pullback, but if & is mono, 
then the bottom square on the right also is a pullback, so clearly in 
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this case k*D,(E — B) = (E — B); that is, the unit 7: 1 > k*Dz 
is an isomorphism. By the following lemma, one sees that the counit 
k*k, — 1 is also an isomorphism. This proves that k: €/B —- €/A is an 
embedding if k: B + A is mono. 


Lemma 1. Let 


bid* ids: FE 


be adjoint functors. Then the unit 1 >» $*¢1 of the first adjunction is an 
isomorphism iff the counit ¢*¢, — 1 of the other adjunction is. Hence, 
os is full and faithful iff dy is. 


Proof: For any objects D and F of €, an arrow h: ¢\D — ¢,£ has 
a transpose under the first adjunction (premultiply ¢*h by the unit 7) 
and also under the second adjunction (postmultiply by the counit €). 
Using both operations in either order gives a commutative square 


E(D, E) —2— €(D, $*b.E) 


ao| | 


E(¢*d1D, E) === F(prD, 6B), 


a 


where the isomorphisms come from the adjunctions. It follows that (ez), 
is an isomorphism for all D and F in € iff (#jp)* is. By the Yoneda 
lemma, we conclude that eg is an isomorphism for each object FE iff 7p 
is an isomorphism for each object D. This proves the first assertion of 
the lemma. Now for any adjunction, the right adjoint functor is full and 
faithful iff the counit is an isomorphism ((CWM, p. 88]). Dually, the 
left adjoint is full and faithful iff the unit is an isomorphism. Applying 
this to ¢* 4 d, and to ¢ 4 ¢* respectively yields the other assertion of 
the lemma. 

If j is a Lawvere—Tierney topology on a topos € and i: Sh; E€ > € 
is the corresponding embedding as above, then we say that a geometric 
morphism f: F — € factors through Sh; € (or through i), when there 
exists a geometric morphism g: F — Sh, € such that the diagram 


F—+uE 

o |: (2) 
WS 
Sh, E 


commutes up to natural isomorphism; that is, g*i* & f*, or equivalently 
(by the uniqueness of adjoints, up to isomorphism, [CWM, p. 83]) that 
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is t4g. = fx. Notice that such a geometric morphism g is unique (again 
up to natural isomorphism) if it exists since 7, is full and faithful. 

Now recall from §V.1 that a topology j on € may equivalently be 
given in terms of a closure operator Sub(E) — Sub(E), A+ A, for all 
AC E and natural in E; moreover, recall that AC F is called dense in 
E when A = E. 


Proposition 2. Let f: # — € be a geometric morphism while j is 
a topology on the codomain €. Then the following are equivalent: 


(i) f factors through Sh, €; 
(ii) the direct image f, sends all objects of F to j-sheaves in E; 
(iii) the inverse image f{* maps dense inclusions of subobjects in E to 
isomorphisms in F. 


Proof: (i)=(ii) If f factors through Sh; € by g in (2), then f, ~ 
t49x, as stated above. Thus for any object F of F, f,F is in the image 
of i,, ie., is a j-sheaf. 

(ii)<>(iii) For a monomorphism u: A E in € and an object F of 
F, consider the commutative diagram expressing naturality 


E(E, fF) ——— F(f*E, F) 
ccoth| [roan 


E(A, fF) ———> F(f* A, F), 

with horizontal isomorphisms given by the adjunction. Now if f,F isa 
sheaf for all F in F and u is dense, then the induced map u* = E(u, f.F) 
is an isomorphism for all F in F [by the definition of a sheaf, V.2(2)]; 
hence so is the induced map F(f*u, F) for all F. By the Yoneda lemma, 
it follows that f*u is an isomorphism. And if f*u is an isomorphism for 
all dense u, then by the diagram again so is the induced map €(u, f.F) 
for all such u. But, by the definition V.2(2) again, this means exactly 
that f,F is a sheaf. 

(ii)=>(i) If f,F is a sheaf for all F, then f,F is isomorphic to its 
sheafification, so that the unit map f, — i,7*f, is a natural isomor- 
phism. Hence if we define a functor g.: F — Sh;€ by gx = i*fe, 
then f, © i,g,. A functor in the opposite direction is then defined by 
g* = f*is, clearly left exact. It is also left adjoint to g,, because, for all 
FeéF and E€ Sh; €, 


F(g*E, F) =F(f*i.E, F) (by definition of g*) 
~ E(1,E, fF) (since f* 4 f,) 
= E(t, Ei." f,F) (since f,F is a sheaf) 
~ Shy E(E,i*f.F) ( 
~ShyE(E,g.F)  ( 


since 7, is full and faithful) 
by definition of 9.). 
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Thus g* 4 9. defines a geometric morphism g with ig & f, as required. 


We now state several equivalent descriptions of surjections. 


Lemma 3. For a geometric morphism f: F — E the following are 
equivalent: 


(i) f is a surjection; i.e., f* is faithful; 

(ii) each unit E + f,f*E, (E € €), of the adjunction is mono; 

(iii) f* reflects isomorphisms; 

(iv) for each object E in €, f* induces an injective homomorphism of 
subobject lattices Sub(&) — Sub(f*E); 

(v) f* reflects the order on subobjects, in the sense that for any two 
subobjects A, B of an object E in €, 


A <B in Sub(E) iff f*A < f*B in Sub(f*E). 


Proof: (i)<(ii) This equivalence is a general property of adjoint 
functors, see [(CWM, p. 88]. 

(i)=>(iv) Let A C E be a subobject with corresponding classifying 
map x4: E — OQ¢. By applying the left exact functor f* we get the 
following pullback in F: 


f*A———  f*1 


|r (true) 


f Ea’ f (Qe). 


If wu: BE is another subobject of E with A < B, such that f*B = 
f*A, then f*(xa)of*(u) = f*(true)o! s«(p), where!;«p: f*B 12 f*1 
is the unique map. Since f* is faithful, also y4 0 u = (true)o!z; that is, 
B<Aand hence B = A. 

(iv)> (iii) Let a: E — E" be an arrow in € such that f*a is an 
isomorphism and let Im(a) € E’ be its image in E’. Since f* is left and 
right exact, it preserves monos and epis and so takes the factorization 
of @ into that of fta: 


f* E —» f*(Ima)-——> f*E’. 


But, by assumption, f*a is iso, hence epi, so f*(Ima) = f*(E’). By 
the assumption (iv) this gives Ima = E’, so a is epi. To see that a is 
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also mono, consider the kernel pair K C E x E of & obtained by pulling 
back @ along a, as in 


Kk —— E 


E a E’, 
Write Ag for the image of the diagonal map, so that An C K C Ex E. 
Now the left exact functor f* preserves pullbacks, so f*K is the kernel 
pair of f*a. Since f*a is mono, f*K = Ay-eg = f*(Az). Thus, by (iv) 
again, K = Ag; in other words, @ is mono. 

(iii)=(i) Consider two parallel arrows a, @: E — E’ in € with f*a = 
f*@, and form their equalizer u: A E= E’. Now f* is left exact, 
hence preserves equalizers, so f*u is the equalizer of f*a = f*@, and thus 
is an isomorphism. By the assumption that f* reflects isomorphisms, u 
must also be an isomorphism. In other words, a = £. 

(v)=(iv) If A, B € Sub(£) and f*(A) = f*(B), then by (v) both 
A<BandB<A,soA=B. 

(iv)=>(v) Clearly, if A < B in Sub(£), then f*(A) < f*(B) in 
Sub(f*E), since f is a functor. Conversely, if f*(A) < f*(B), then 
f*(A) = f(A) A f*(B) = f*(AA B), the latter equality by the left 
exactness of f*. Then (iv) yields A = AAB and hence A < B. 


Proposition 4. A geometric morphism f: F — € is a surjection iff 
there exists a left exact comonad (G,¢,6) on F and an equivalence of 
categories e such that the diagram 

f 


FE 


SI 


commutes up to isomorphism, where p is the canonical surjection to the 
category Fg of G-coalgebras, as in (6) of §1. 


Proof: Since p is a surjection for any comonad G, the “only if” 
assertion is clear. Conversely, the given geometric morphism f is a pair 
of adjoint functors f* 4 f,, so induces a comonad (G,¢€,6) on F, with 
G = f*f, (as in the beginning of §V.8). Since f* is left exact, so is 
this comonad G. Now f* reflects isomorphisms by Lemma 3, while 
both F and €, as topoi, have coequalizers. Therefore € is equivalent 
to the category Fg of coalgebras for G (and under the equivalence f* 
corresponds to the forgetful functor of coalgebras), according to the 
dual of Beck’s weak tripleability theorem, stated here explicitly for the 
convenience of the reader. 
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Lemma 5. Let L: C + D and R: D > C be functors, with L left 
adjoint to R with unit n. Let (G,6,e) be the induced comonad on D 
with G = LR, and let K: C + Dg be the comparison functor sending an 
object C of C to the coalgebra K(C) = (LC, Lnc: LC > LRLC). IfC 
has (reflexive) equalizers, and L preserves (these) equalizers and reflects 
isomorphisms, then K is an equivalence of categories. 


Note: This result holds without the qualification “reflexive”, or 
with “reflexive equalizers” at both points; here a reflexive equalizer is 
an equalizer of a reflexive pair. 


Proof: This lemma is simply the dual of Corollary IV.4.3. 


The next two theorems will show that every geometric morphism 
can be factored as a surjection followed by an embedding, and this in 
an essentially unique way. 


Theorem 6 (Factorization Theorem; existence). Let f: F > 
E be a geometric morphism. Then there exists a topology j on € for 
which f factors through the embedding i: Sh; € — E by a surjection p: 


f 


F ——— E 


Proof: The given f determines a topology j on € by way of the 
following natural closure operator (—) on €. For an object E € € and 
a subobject U C E, define U as the pullback in the diagram 


U ——> f. f*U 


i > 


Ep ies £; 


where 7 is the unit of the adjunction f* 4 f.. This indeed defines a 
closure operator. First, U € U holds by the diagram expressing the 
naturality of 7. Moreover, since f,f* preserves pullbacks and pullbacks 
preserve intersections, U 7 V = UNV for two subobjects U_and V of 
E. Finally, (4) below shows for any subobject of U that f*(U) C f*U 
and hence that f*(U) C f*(U) C f*(U). Then (4) again gives the 
last condition U ¢ U for a closure operator. The closure operator thus 
defined is clearly natural in E. Therefore it corresponds to a unique 
topology j on €, with corresponding embedding i: Sh, € — €. 
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Next, notice that the adjunction f* 4 f, and the universal property 

of the pullback square (3) imply that for any two subobjects U and V 
of E, 

V CU iff f*V C f*U. (4) 


Indeed, subobjects v: V— E and u: UE give a commutative dia- 
gram 


y —_———_"—______+ p. fv 


(5) 


E — TE” [SE 


with inner square a pullback as in (3). If f*V C f*U, then also f, f*V C 
f«f*U, so there exists a dotted arrow making the right-hand triangle 
commute. By the universal property of the pullback, it follows that there 
exists a dotted arrow V — U making the left-hand triangle commute; 
ie., V < U. Conversely, given an arrow V — U such that the left- 
hand triangle commutes, we get by composition with U > f,f*U an 
arrow pw: V > f,f*U such that f, f*(u) op = nz ov. Transposing this 
identity along the adjunction E(V, f, f*E) = F(f*V, f* E), we find that 
f*(u)ofi = f*(v), where ji: f*V — f*U is the transpose of uy and f*(v) 
is mono. Thus i is mono and therefore f*(V) < f*(U). 

To show that f: F — € factors through the inclusion i: Sh; € > € 
for this topology j, it suffices, by Proposition 2, to check that f* sends 
j-dense subobjects U — E in E to isomorphisms in F. But if U C E is 
dense, then E = U; so by (4) above (with V = E) we get f*E = f*U, 
ie., f*(U — E) is an isomorphism. Thus f indeed factors through i, say 
as f =iop: 

f 


F—+WE 


c | (6) 


It remains to be shown that the factor p is a surjection. To see that 
condition (iv) of Lemma 3 is satisfied, take subobjects U C V C E in 
Sh, € and suppose that p*U © p*V. Then f*i,U = p*i*i.U = p*U = 
pV = ftisV, so i,U = 1,V by (4), since subobjects in the image of 
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t, are closed (cf. Lemma V.2.4). But i, is just the inclusion functor, so 
U = V. This shows that p is a surjection, and completes the proof of 
the theorem. 


Next we state an analog of Proposition 4 and parts of Lemma 3. 


Corollary 7. For a geometric morphism f: F — € the following 
are equivalent: 


(i) f is an embedding (i.e., the direct image functor f, is full and 
faithful); 
(ii) the counit er: f* f,.F — F is an isomorphism, for each object F 
of F; 
(iii) there is a topology j on € and an equivalence e: F = Sh, € such 
that the diagram of geometric morphisms 


F f 


~~ | (7) 


Sh, € 
commutes up to a natural isomorphism e*i* = f*. 


Proof: The equivalence (i)<>(ii) is a general fact about adjoint func- 
tors, see [CWM, p. 88]. 

(iii)=>(i) Suppose (7) is given, so f = ioe. Now i, is full and faithful 
and e, is an equivalence of categories, so i,e, is full and faithful, and 
therefore by the isomorphism i.e, = f., so is fx. 

(i)=>(iii) By the theorem, f factors as in (6), where i is an embedding 
and p is a surjection. In particular, i, is full and faithful, as is f, by 
assumption. Hence so is p,. Therefore, by the equivalence (i)+(ii) 
applied to p, the counit er: p*p,F — F is an isomorphism for each F' 
in F. By the triangle identity 


pE P"NE p*p.p*E 


pE 


we now see that p*7z must be an isomorphism. By Part (iii) of Lemma 3 
it follows by the surjectivity of p that nz is an isomorphism, for each 
E € €. Thus, since each counit er and each unit ng is an isomorphism, 
p is an equivalence. 
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Theorem 8 (Factorization Theorem; uniqueness). 
p 

Let f: F — E€ be a geometric morphism, while F —> A —, € and 

q 
F + B —» € are two factorizations of f, in the sense that the relevant 
triangles formed with f commute up to natural isomorphisms. If v is 
an embedding and p is a surjection, then there is a geometric morphism 
g, unique up to natural isomorphism, such that both triangles in the 
diagram 


A 


ae a 
F 
ne 


é (8) 


=) 


oO < 


commute up to natural isomorphism. If also u is an embedding and q is 
a surjection, then g is an equivalence. 


Proof: By Corollary 7, we can assume that the embedding v is the 
inclusion i: Sh;€ — E€ for some topology j on €. If UE is a j- 
dense monomorphism in €, then, as above just before (6), v*U — v*E 
is an isomorphism, and hence so is the left-hand vertical map in the 
commutative square 


qru*U pu*U 
qu" E ——— p*u*E. 


Since p is a surjection, p* reflects isomorphisms (Lemma 3), and hence 
u*U — u*E must be an isomorphism. By Proposition 2 [(iii)=>(ii)], it 
follows that u factors through v, so there is a g: A > B with vg = u 
(hence v.g. = us). This g is unique up to isomorphism [as stated below 
(2)]. Moreover, U1gsDx & UsPs = Vege, 80, Since v, is full and faithful, 
we conclude that g,p. = q.; in other words, the left-hand triangle in (8) 
also commutes up to isomorphism. 

Now suppose that u is an embedding and gq is also a surjection. Then 
we find a similar factorization h: B + A. By the uniqueness part proved 
(for g), both goh and hog must be isomorphic to the identity, so each 
is an equivalence, and the theorem is proved. 

Sheaves on topological spaces provide an immediate example of this 
surjection-embedding factorization for geometric morphisms. Indeed, 
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for any continuous map f: X — Y between spaces, we can equip the 
image f(X) with the subspace topology from Y, so as to factor f as 


X —24 f(X) +, 


where p is a surjection and i is the inclusion of the subspace, both 
continuous. This gives a corresponding factorization of the geometric 
morphism f: Sh(X) — Sh(Y) as 


Sh(X) —2-> Sh( f(X)) —*— Sh(Y), 


and, as observed in the beginning of this section, p is a surjective geo- 
metric morphism while 7 is an embedding. 

Similarly, if k: B — A is an arrow in an elementary topos €, one 
may factor & as an epimorphism followed by a monomorphism, as in 
SIV.6: 


For the corresponding slice categories, this gives a factorization of the 
geometric morphism k: €/B > E/Aas E/B ae E/f(B) aie €/A, where 
the first morphism is a surjection and the second is an embedding, as 
proven at the start of this section. 

As another example, consider a functor ¢: C — D between small 
categories C and D, and the induced essential geometric morphism 


¢: SetsC” ——+Sets?”, 1 4¢6*4d« 


of presheaf topoi, as described in §2. It readily follows that ¢* is faithful 
if every object of D is isomorphic to an object in the image of ¢. We 
claim that ¢, is full and faithful iff the original functor 6: C — D is full 
and faithful. Indeed, if ¢ is full and faithful then for each object C' in C 
and each presheaf P in Sets©’”, 


$*41(P)(C) = o(P)(6C) (definition of $*) 
= Pc D(dC, ¢-) (definition of ¢1) 
= P@cC(C, -) (since ¢ is full and faithful) 
= P(C), 


the latter isomorphism since as in §2.(22) the Hom-functor is the identity 
for the tensor product ®c. This isomorphism 1 ~ ¢*¢; is the unit, so 
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by Lemma 1 above it follows that ¢, is full and faithful. Conversely, 
suppose @¢, is full and faithful, and consider the diagram 


It easily follows from the tensor-product definition of ¢ that this dia- 
gram commutes (up to natural isomorphism). But ¢, is full and faithful 
since ¢. is (by Lemma 1), as are both Yoneda embeddings. So ¢ must 
be full and faithful as well. 

Now if »: B — D is any functor between small categories B and D, 
one may take C = ~(B) to be that full subcategory of D whose objects 


B @ 
are all the 7(C), C € C. Then w factors as B — C — D with ¢ full 
and faithful, and @ surjective on objects. This yields a factorization 


Sets?” —° , SetsC” —2 . SetsP” (9) 


of the geometric morphism 7: SetsP”” — SetsP””. We have just shown 
that since ¢ is full and faithful, so is ¢.; ie., d: Sets©” — SetsP”” 
is an embedding. Also, by an earlier remark, the geometric morphism 
6 in (9) is a surjection since G: B — C is surjective on objects. Thus 
(9) is the surjection-embedding factorization of the geometric morphism 
w: Sets?” — SetsP’” induced by %: B > D. 


5. Points 


A point of a topos € is a geometric morphism p: Sets — €. In 
particular, a point x of a topological space X can be considered as a 
(continuous) map from the one-point topological space 1 into X; as 
in §1 this gives a geometric morphism x: Sets = Sh(1) — Sh(X), in 
other words, a “point” of the topos Sh(X). Explicitly, according to the 
description of f* in §1, for a sheaf F on X the inverse image x*(F) is 
exactly the stalk of F at x (the pullback of F along x: 1 — X). In 
the opposite direction, the sheaves in Sh(1) are just the sets S, and the 
description of the direct image functor x, in §1 shows for each open set 
U of X that 


z.(S)\(U)=SifxeU; «,(S)(U)=life ¢U. (1) 


This sheaf x,(S) on X is the “skyscraper” sheaf at x [Lemma II.6(11)]. 
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On the other hand, there are topoi which do not have any points 
at all; for instance, the topos of sheaves on a complete Boolean algebra 
[as a special case of §III.2, Example (d)| is “pointless” if the Boolean 
algebra is atomless (Exercise 9). 

The points of a Grothendieck topos € will be analyzed in this section, 
beginning with the case of a presheaf topos on a small category C. 

Let f: Sets > Sets©” be a point of Sets©”. Since every object 
of Sets©”” is a colimit of representables, and since the inverse image 
functor f*: Sets” — Sets, as a left adjoint, must commute with 
colimits, this functor f* is completely determined up to isomorphism 
by what it does to representables. Or, in other words, f* is determined 
up to isomorphism by its composite f* oy: C — Sets with the Yoneda 
embedding. Hence we will aim to describe points of Sets©’”—that is, 
geometric morphisms f: Sets > Sets©””—purely in terms of suitable 
(covariant) functors A: C —> Sets. 

By the Hom-tensor adjunction (15) of §2, each such functor A: C > 
Sets yields a pair of adjoint functors Sets 2 Sets© ” , given for a set S$ 
and a presheaf R as 


S++ Hom, (A, S): Sets > eae (2) 


Re ROA: Sets©” — Sets. 
Here, as in §2, Hom,(A, —) is the presheaf defined for each set S by 
Hom, (A, 5)(C) = Hom(A(C), $); 


it is right adjoint to the tensor product — ®c A, as in (15) of §2. This 
tensor product, as there, is the set of pairs r ®a of elements r € R(C), 
a € A(C), with the identifications rg @ a’ = r ® ga’ for each g: C' > C 
and a’ € A(C’). Equivalently the tensor product was described as the 
coequalizer §2(13) of two maps into a coproduct [|¢ R(C) x A(C). If we 
regard the set-valued functor R as a set which is “indexed” by C' € Co 
and thus as a set over Co, this coproduct can be written as a pullback 
R xc, A. The coequalizer then appears in the form 


Rxo, XCi xc, AS RX, A—— ROCA. (3) 


Since the resulting functor — @c A is a left adjoint, it commutes with 
colimits. Also the representable functors R = C(~—,C) = yC act (like 
the ground ring in the tensor product of modules) as left identities, in 
view of the canonical natural isomorphism [cf. §2(20) for the bifunctor 
description] 

YC @cA=C(—,C)®@c AX A(C). (4) 
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given for g: C’ > C and a € A(C”) by 
gSangacA(C). (5) 


In other words, the diagram of functors 


commutes up to natural isomorphism. This diagram states that the 
functor — ®c A is an extension of A along the embedding y. [In fact, 
Exercise 3, it is the (left) Kan extension of A along y; for the definition 
of Kan extensions, see [CWM, p. 236].| 

In the special case where the functor A comes from a point f of 
Sets©”” as the composite 


A=(C—) setsc” ©, Sets), (7) 

there is for any presheaf R a canonical map of sets, 
er: R@cA— f*(R), (8) 
natural in R. To define this map for an object C of C, and elements 
a € A(C) and r € R(C), use the Yoneda lemma to regard r as a natural 


transformation + = C(—,C’) — R and so as a morphism of presheaves. 
Now set 


er(r @a) = f*(F)(a), (9) 
as displayed in the diagram, describing the image of r @a, 
c—__4——-. Sets C(-,c) Fy(C)=A(C) a 
X vy, TR I _ ro | 
Sets” R f*(R) F*(F)(a) 


To justify this definition of the map ex by the definition of the tensor 
product, we must consider g: C’ > C and a’ € A(C’) and show that 


er(Tg@a’) = er(r @ga’). (10) 
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Indeed, this follows from the following computation, 


f(r -9) a’) = fF F oy(g))(0’) 
= f* (*)f* (y(g))(@’), f* is a functor, 
= f*(*A(g)(@’), Py =A, 
= f*(F)(g- a’). 


In case R = C(— ,C) use (7) to show that the definition (9) reduces to 
the isomorphism (4), so that in this case eg as in (8) is an isomorphism. 
But in (8), both functors — @A and f* commute with colimits, and 
every presheaf is a colimit of representables. It thus follows that er is 
an isomorphism for each presheaf R in SetsC”’. So, up to isomorphism, 
the inverse image part f* of every point f: Sets > Sets© ” is given by 
tensoring with some functor A: C — Sets. 

Conversely, given such a functor A: C — Sets, the induced functor 
— @c A: Sets©” — Sets always has a right adjoint Hom, (A, ~), as 
in (2). So A is a good candidate to define a geometric morphism 


g: Sets — Sets” 


i.e., a point of Sets©” by 
9 (R)=R®cA, — gx(S) = Home (A, $), (11) 


for any presheaf R € Sets©’” and any set S. However, the functor g* 
thus defined is not necessarily left exact. 

Now recall that for abelian groups # and A (or for modules over a 
ring), the corresponding tensor product R +> R®A is not necessarily 
left exact because a short exact sequence 0 —- R — S > T — 0 isn’t 
necessarily turned into a short exact sequence 0 —- R@A— S@A- 
T®A — 0. For exactness, replace the left-hand zero by the so-called 
torsion product Tor(T, A); see [Mac Lane 1963, p. 151]. The mod- 
ule A is called flat when — ®A does turn short exact sequences into 
short exact sequences. The corresponding convention in the context of 
presheaves is: 


Definition 1. A set-valued functor A: C — Sets is said to be flat 
when the induced tensor product functor — @c A is (left) exact. 


Notice that — @c A is always right exact since it has a right adjoint. 
With this definition, we can summarize the preceding discussion by the 
following statement: 


Theorem 2. Points of the presheaf topos Sets©” correspond to 
flat functors A: C — Sets. 
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More precisely, let Flat(C) be the category of flat functors C > 
Sets and natural transformations between them; furthermore, as in 
§1, write Hom(Sets, Sets©”) for the category of geometric morphisms 
Sets — Sets©” and natural transformations (between their inverse 
image parts). Then Theorem 2 can be formulated more explicitly as: 


Theorem 2 [bis]. There is an equivalence of categories 
Flat(C) == Hom(Sets, Sets©” ) 


with the functors 7 and p defined, for flat functors A: C — Sets and 
points f: Sets — Sets”, by 


7(A)* = — @c A, 7(A)« = Hom(A, —), 
p(f)=f*oy: Co Sets©” — Sets. 


Proof: Clearly 7(A) is functorial in A and p(f) is functorial in f. 
Furthermore, for each object C’ of C there is, by (4) above, an isomor- 
phism pt(A)(C) = y(C)®@cA & A(C) which is natural in A and C. 
Thus pr(A) = A, natural in A. In the other direction, for a geometric 
morphism f, there is an isomorphism — @c p(f) & f* as in (8) above, 
readily seen to be natural in f. Thus, pr = id and rp & id, so 7 and p 
do form an equivalence of categories. 


We next study points of sheaf categories. First consider a 
Grothendieck topology J on C and the corresponding embedding 


Sh(C, J) —*— Sets©”. 


This geometric morphism is given by the inclusion functor 
i,: Sh(C,J)— Sets” taking sheaves into presheaves (by definition 
ix is a full and faithful functor), and the associated sheaf functor 
i*: Sets©” — Sh(C,J). Since i, is full and faithful, the category 
Hom(Sets, Sh(C, J)) of points of the topos Sh(C, J) is equivalent to 
the full subcategory of Hom(Sets, Sets© ’) consisting of those geomet- 
ric morphisms which factor through the embedding 7. This leads to: 


Lemma 3. For a Grothendieck topology J on C and a point 
f: Sets > Sets©”” the following are equivalent: 


(i) f factors through the embedding i: Sh(C, J) Sets”; 
(ii) the composite f* oy: C — Sets sends each covering sieve in C 
to a colimit diagram in Sets; 
(iii) f* oy: C — Sets sends each covering sieve to an epimorphic 
family of functions. 
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The meaning of the last two statements may need some clarification. 
Recall that a sieve S on an object C’ of C can equivalently be regarded 
as a subpresheaf S C y(C) or as a family S of arrows u: D — C, all 
with the same codomain C’, and closed under composition on the right 
{iiec., u € S implies u ov € S whenever the composition makes sense). 
The Yoneda embedding y: C > Sets©”” sends such a family of arrows 
S to a cocone {y(u): y(D) > y(C) | u € S} on y(C) in the category 
Sets”. Notice that this cocone is the canonical representation of the 
presheaf S C y(C) as a colimit of representables (Proposition [.5.1): 

Le Baas” ©) =o y(C). (12) 
Part (ii) of the above lemma for a covering sieve states that f* sends this 
cocone on y(C’) to a colimit in Sets, and Part (iii) states that the family 
of functions { f*y(u): f*y(D) > f*y(C) | u € S} is jointly surjective. 

Proof of Lemma: (i)=>(ii) Let S be a covering sieve on C’ for the 
topology J, so that S — y(C) is a dense subpresheaf. By condition (iii) 
of Proposition 4.2, f*(S)— f*(yC) is then an isomorphism. But /f* 
preserves colimits, so by (12) we get an isomorphism 

lim pues y(D) —— fry(C); 
thus condition (ii) is satisfied. 

The next implication (ii)=> (iii) is clear from the definition of a co- 
limit. 

(iii)>(i) By Proposition 4.2, Part (iii) again, it suffices to show that 
f* sends dense inclusions of subpresheaves to isomorphisms. So let B € 
P be dense, and write the presheaf P as a colimit of representables 
(Proposition I[.5.1), say as P = lim y (Ci). For each index i, define the 
presheaf B, by pullback as in 


B-————_ P 


a. 


By—— y(C.). 
Then, since pullbacks preserve colimits (Theorem IV.7.2), B = lim Bj. 
Moreover B;— y(C;) is dense. We may regard B, as a subfunctor of 
y(C;), ie., as a sieve on C,; then density means exactly that this sieve 
B, is a cover for the topology J. Now consider for each arrow u: D > C, 
in B, the diagram 
F*(Bi)— Fr (¥(G,)) 


| A4om 


f*y(D). 
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Since the lower arrows f*y(u), for all wu € B;, form a surjective fam- 
ily [by assumption (iii), it follows that f*(B;)— f*(y(C,)) must also 
be a surjection of sets, hence an isomorphism. But f* preserves colim- 
its, so if f*(B;)— f*(y(C;{)) is an isomorphism for each i, then so is 
f*(B)— f*(P) since B = lim B; and P = limy(C;). This proves the 
lemma. 

A functor A: C — Sets is called continuous (for a Grothendieck 
topology J on C) if A sends covering sieves to colimit diagrams. If A 
is also flat, then it follows from Theorem 2 [bis] that up to isomorphism 
A is of the form f* oy: C = Sets©” for some geometric morphism 
f: Sets = Sets©”. Thus, by Lemma 3, a flat functor A is continuous 
iff it sends covering sieves to epimorphic families. If we write 


ConFlat(C) 


for the full subcategory of Flat(C) given by the continuous flat functors, 
then Theorem 2 and Lemma 3 immediately yield the following result: 


Corollary 4. Let (C,J) be a site. Points of the topos Sh(C, J) 
correspond to continuous flat functors C — Sets by an equivalence of 
categories 

ConFlat(C) —= Hom(Sets, Sh(C, J)) 
which is given by restricting the equivalence of Theorem 2. 


Proof: Consider the equivalence 
7: Flat(C) = Hom(Sets, Sets©”) ip 

of Theorem 2. The categories ConFlat(C) and Hom(Sets, Sh(C, J)) are 
full subcategories of those occurring in this equivalence. Now if a flat 
functor A: C — Sets is continuous, i.e., if A transforms covering sieves 
into colimits, then so does pr(A) = 7T(A)oy: C > Sets©” — Sets, 
by the isomorphism A = pr(A) of Theorem 2 . Thus, by Lemma 3, the 
geometric morphism 7(A): Sets > Sets©’” factors through the embed- 
ding Sh(C, J) > Sets©”. On the other hand, if a geometric morphism 
f: Sets — Sets©”” factors through Sh(C, J), then again by Lemma 3 
the flat functor p(f) = f* oy: C — Sets is continuous. It follows that 
the equivalence of Theorem 2 [bis] restricts to an equivalence of these 
full subcategories, as stated in the corollary. 


6. Filtering Functors 

Theorem 5.2 and its Corollary 5.4 are not of much use to us without 
a more practical description of flatness directly in terms of the func- 
tor A: C > Sets. The purpose of this section is to arrive at such a 
description. 
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Recall that a small category I is filtering, alternatively filtered, if it 
enjoys the following three properties: 


(i) I is nonempty; 
(ii) for any two objects i and j of I there is a diagram i — k > j in 
I, for some object k; 
(iii) for any two parallel maps i= j in I there exists a commutative 
diagram of the form k — i337 in I. 


In other words, there are objects in I, any two can be “joined”, and any 
two parallel arrows can be equalized. Recall also that limits taken over 
a filtering category are called filtered limits ((CWM, p. 208)). 


Lemma 1. A small category I is filtering iff for any finite diagram 
in I there exists a cone on that diagram. 
In particular, a category with an initial object is always filtering. 


Proof: A cone with vertex k on the diagram wu, v: i =3 j in I consists 
of arrows a: k > i and 8: k >i such that ua = @ and va = f. Next a 
cone with vertex k on the diagram i, j (of two objects and no arrows) 
is given by morphisms a: k > 7 and 8: k — j. Finally, a cone on the 
empty diagram simply consists of an object k—the vertex of the cone. 
Thus, if there is a cone on every finite diagram in I, then I is certainly 
filtering. 

Conversely, suppose I is filtering. We prove that there exists a cone 
on any diagram by induction on the number of (nonidentity) arrows in 
that diagram. If there are no arrows at all, then the diagram consists 
only of a finite set of objects, say i1,...,i,, and repeated application 
of Part (ii) of the definition of “filtering” categories [or Part (i) in case 
n = 0] produces an object & in I with morphisms a4: k — ig, for t = 
1,...,7. 

For the inductive step, suppose we are given a cone consisting of 
morphisms a,;: k > j on a finite diagram in I, and that one new arrow 
u:i— j is to be added to the diagram, as lower left in the figure 


k' 

( 

! 
oI 
t--g->g — jj’ — jj" 


Then perhaps ua; # a3, so this is not a cone. But by Part (iii) of the 
definition there exists an arrow @: k' — k from some object k’ in J with 
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ua, 3 = a3. This provides a new cone with vertex k’ on the enlarged 
diagram. 

The category of elements of a contravariant functor defined in §1.5 
can be formulated also for a covariant functor A: C — Sets. Given 
such a functor A, construct a category ie A of elements of A with as 
objects the pairs (a,C') where a € A(C) and C’ € C, and as arrows from 
one such pair (a,C’) to another (b, D) those arrows u: C — D for which 
A(u)(a) = 6b; that is, with u-a = b. There is an evident projection 
functor 


TA: [anc (1) 


The category of elements {., A is also called the Grothendieck con- 
struction on the functor A (or the diagram of A). For example, if 
A = Hom(C, — ) is a covariant representable functor, then ie A is the 
category of all arrows in C from the object C’, sometimes written as the 
comma category C/C. 

(The “Grothendieck construction” also has a more general form, for 
functors from a small category C not to sets but to the category of all 
small categories; the result is then a fibered category.) 

With this we now define filtering functors: 


Definition 2. A functor A: C — Sets is said to be filtering if its 
category of elements te A is a filtering category. 


Explicitly, this means that a functor A is filtering iff it satisfies the 
following conditions: 


(i) A is nonempty; i.e., A(C) # @ for at least one object C of C. 
(ii) Given elements a € A(C) and b € A(D), there exist an object B, 


morphisms C’ “Bs Din C, and an element c € A(B) such 
that u-c=aand v-c=b. 

(iii) Given two parallel arrows u, v: C — D in C and ana € A(C) 
such that u- a= v-a, there are an arrow w: B — C in C and an 
element 6 € A(B) such that uw = vw and w-b=a. 


The desired explicit description of flatness can now be stated as 
follows: 


Theorem 3. A functor A: C — Sets is filtering iff A is flat; that 
is, iff the functor ~ @c A is left exact. 


Here the “only if” assertion is a generalized form of the categorical 
result that finite limits commute with filtered colimits [CWM, p. 212]. 
On our way to the proof of Theorem 3, let us consider again the 
functor — ®c A: Sets©” — Sets. For a presheaf R € Sets© , the set 
R®c A is by definition [see (3) of §5] the quotient of the set R xq, A 
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by the smallest equivalence relation which identifies (rg, a) and (r, ga) 
for any g: C’ > C in C, r € R(C), and a € A(C’). In the special case 
where A is filtering (as in Definition 2) this equivalence relation can be 
described in a more direct way: given pairs (r,a) € R(C’) x A(C) and 
(r’, a’) € R(C’) x A(C’), one has 


r®a=r'@a' iff there are arrows C «< D —> C’ and some 
b € A(D) such that ub =a, vb =a’, andru=r'v. (2) 


Indeed, if there are u, v, b as in (2), then r@a = r®ub = ru@b = 
r'y®b=r' @a'. Thus it suffices to prove that the right-hand side of (2) 
defines an equivalence relation. It is clearly reflexive and symmetric. To 
see that it is also transitive, suppose we are given (r, a) and (r’, a’) related 
as in the right-hand side of (2), and (r”’, a’) related to (r’, a’) in the same 


way; say r” € R(C”), a” € A(C”), and there are C’ nae en C” and 
e € A(E) such that we = a’, te =a", andr'w=r"t. Now yu, v, w, and t 
can be viewed as arrows in Je A as in (3) below, and by Lemma 1 there 
exists a cone on the diagram formed by these arrows, say with vertex 
the object (f, F) € J, A [ie., f € A(F) and F € C], as in 


(a,C) 

eee 
(0, D) 

oe Tee 

(fF) (a’,C’) (3) 

ons Be 

(e, E) 
> 
(a”,C"). 


Thus, & and @ are arrows in f{, A and vk = wé; or in other words, 
k: F — D and é: F — E are arrows in C such that k- f = 6 and 
£- f =e, and vk = wt: F > C’ in C. But then (r, a) and (r’,a”) are 


related as in the right-hand side of (2), namely, by C ee F Ba C” and 
f € A(f). This shows that the right-hand side of (2) indeed defines 
an equivalence relation, and hence that the quotient R ®c A of the set 
R xc, A can indeed be described as asserted in (2). 

Proof of Theorem 3: (<=) Suppose the given functor A: C > 
Sets is flat; ie., that — @c A: Sets©” — Sets is left exact. Then, 
first of all, the functor — ®c¢ A must preserve the terminal object, ie., 
1@c A isa one-point set. So in particular A must be nonempty, and the 
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condition (i) below Definition 2 holds. For condition (ii), the fact that 
— @c A preserves products shows, for any objects C' and D from C, that 
the canonical map (y(C) x y(D)) @c A > (y(C) @c A) x (y(D) @c A) 
is an isomorphism. The isomorphism (4) of §5 for both objects C’ and 
D then shows that the map 


(yC x yD) @c A> A(C) x A(D), 


u v (4) 

(B-+C,B—- D)®@cr (u-c,v-c) 
is an isomorphism. Therefore, any pair (a,b) € A(C) x A(D) must 
be in the image of the map (4); this is precisely condition (ii) below 
Definition 2. Finally, to show that condition (iii) holds, consider u, 
vu: C > D and a € A(C) such that ua = va. Let 


y(u) 
P»>—— yC ——2 yD 

y(v) 
be the equalizer in Sets©’; so P is the presheaf on C given for each 
object B by P(B) = {w: B>C | uw = vw}. The left exact functor 
—~ 8c A now transforms this equalizer into an equalizer diagram of sets, 
which by the isomorphism (4) of §5, can be written as 

A(u) 

P®c A ane A(D), (5) 
where for w € P(B) and 6 € A(B) one has i(w @b) = w-b € A(C). But 
A(u)(a) = A(v)(a) by hypothesis, so since (5) is an equalizer of sets, 
there must be some such w and 8 for which w- b= a. This shows that 
condition (iii) holds, completing the proof that A is filtering. 

(=) Assume that the functor A: C — Sets is filtering, so that condi- 
tions (i)—(iii) below Definition 2 hold. Then the set 1 @c A is nonempty 
by condition (i); and condition (ii) shows by the definition (2) of equiv- 
alence that any two elements of 1@®c A are equivalent. Thus, 1@c A is 
a one-point set, so the functor — ®c A preserves the terminal object. 

It remains to be shown that — ®c A preserves pullbacks. Let 


RxpQ—"—>Q 


‘ | [2 


R——— P 


wo 


be a pullback of presheaves on C, while C is an object of C. Since 
pullbacks of presheaves are pointwise, 


(RxpQ)(C) = {(r,4) |r € R(C), g € Q(C), and ¥(r) = (4) € P(C) }. 
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Consider the map 
a: (R Xp Q)®cA- (R®@c A) X(Pe@_ A) (Q Bc A) 
given, for C € C, r € R(C), ¢ € Q(C), and a € A(C), by 
a((r,q)@a) = (r@a,q@a). 
It is clear that a is well-defined [i.e., respects the typical tensor product 
identities as in §2(14). We have to show that a is an isomorphism. 
First a is surjective: Consider (r @a,q®a’) in the codomain of a, 


with g € Q(C), r € R(D), a € A(D), anda’ € A(C). Since the codomain 
of a is a pullback of sets, we have w(r) @a = ¢(g) @a’ in P@c A. But 


A is filtering, so by (2), there are C —— B — D and some b € A(B) 
such that ¢(q)-u = ¥(r)-v in P(B) and u-b =a’, v:b =a. But then 
o(g: u) = o(r-v) by naturality of ¢ and ¥, so (rv, qu) € (R xp Q)(B), 
and 
a((rv, qu) @b) = (rv @b, qu@b) 
= (r @ vb, ¢@ ub) 
= (r@a,q@a’). 
Thus, the given element (r ®a,q@a’) is in the image of a, so a is 
surjective. 
Second, a@ is injective: Suppose that 
a((ri, ) ® a1) — a((re, q2) & a2), 
where r, € R(C,), g, € Q(Ci), and a, € A(C,), for i=1, 2. Thus 
T) @ a) = 12 @ a2, gi © a1 = qo @ ag. 
By the definition (2) of the equivalence, there are C) aay ; eee C> in 
C and b € A(B), and C; —- D —» Cy in C and d € A(D), such that 
git = g2ua, Ub = a, 
T1U) = Teva, uid = aj. 


Consider now the diagram in the category {(, A consisting of the solid 
arrows in (6) below: 


(d,D) 


oe v2 
V1 
NS 
7 
v4 
’ 
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Since A is filtering, it follows by Lemma 3 that there exists a cone on this 
diagram in {(, A, say with vertex (c, K), where K is an object of C and 
ec € A(K), as indicated by the dotted arrows in (6). Thus w: K — D 
and s: kK — B are arrows in C such that 


UjwW = Wis, we = d, sc=b (i = 1,2). 
But then a direct calculation with the tensor identities §2(14) yields 
(71,91) ® @1 = (r2, go) @ a2. 


This shows that a is injective, and completes the proof of Theorem 3. 


Corollary 4. Let C be asmall category with finite limits. A functor 
A: C — Sets is flat iff A is left exact. 


Proof: (=) The Yoneda embedding y: C— Sets©” preserves all 
those limits which exist in C. If A is flat, then by definition 
~ @cA: Sets©” — Sets preserves finite limits. But then so does 
the composite (~ @c A) oy: C — Sets. The latter functor is naturally 
isomorphic to A [cf. (4) and (6) of §5], so A itself preserves finite limits 
as well. 

(<=) It is easy to verify directly that a left exact functor A: C > 
Sets must satisfy conditions (i)-(iii) immediately below Definition 2. 
Indeed, the condition (i) holds because A(1) = 1; (ii) holds because 
A(C) x A(D) = A(C x D) for any two objects C' and D in C, so we can 

Ww 


take C <~ C x D —> D as our arrows C — B — D in (ii). Finally, 
(iii) holds because A preserves equalizers [take B in condition (iii) to be 
the equalizer of u and v]. 


Corollary 5. Let D be a small category. Then the colimit functor 
lim: Sets? — Sets commutes with finite limits iff D°? is filtering. 


(The “if” assertion is familiar; e.g., [CWM, Theorem IX.2.1].) 

Proof: Let C = D°? and let A: C — Sets be the constant functor 
1. Then for any P € Sets” = Sets”, condition (2) and the definition 
of colimits show that P@c A = lim P. So by Theorem 3, the functor 
lim: Sets? — Sets is left exact iff Jc Ais filtering. But clearly {oAis 
(equivalent to) C itself when A = 1. 


7. Morphisms into Grothendieck Topoi 

In this section we will study geometric morphisms from a topos E 
into a Grothendieck topos Sh(C, J) of sheaves on some site (C, J). So 
compared with the Section 5, we have replaced Sets by a topos €. The 
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analysis will be parallel to that in a prior section. Again we will begin 
with the case of a presheaf topos Sets“, and establish a correspon- 
dence between geometric morphisms € — Sets©” and flat functors 
C — €, analogous to Theorem 5.2. Next, as in Corollary 5.4., this corre- 
spondence restricts to one between geometric morphisms € — Sh(C, J) 
and continuous flat functors C > €. In a subsequent section we will 
also give a more explicit description of flatness, similar to Theorem 6.3. 

Suppose f: € > Sets” isa geometric morphism. Its inverse image 
functor f*: Sets©” = E€ preserves colimits, and every presheaf R on C 
is a colimit of representable presheaves, so (up to natural isomorphism) 
the functor f* is completely determined by what it does to representa- 
bles. More explicitly, each presheaf R € Sets©” is canonically a colimit 
lim H of representables. Indeed, R determines the category { R of its 
elements, with objects the pairs (C,r), where r € R(C) and C € C, 
while H: [R= Sets©” sends this object (C,r) to Hom(—,C) =yC. 
Exactly as in §2(10) this colimit lim H can be presented as a coequalizer, 
with coproducts over the arrows u: (C’,r’) — (C,r) and then over the 
objects of f{ R. This gives the following coequalizer in Sets©’”: 


I yoo I] vor 
u: OC Cec (1) 
reR(C) réR(C) 
specifically, if we write elements of [],,y(C’) as triples (7, u, v) with 
r € R(C), u: C’ > C and v € y(C’)(B) = C(B,C”) for some object 
B in C, and similarly write elements of [[~,. y(C) as pairs (r, w) with 
w € C(B,C), then 6 and 7 in (1) are the maps given by 


A(r, u,v) = (r, vor), r(r, u,v) = (ru, v). 


The functor f*: Sets©” — E preserves colimits, so it sends this co- 
equalizer (1) to the following coequalizer in the topos €: 


I] fyeo=—= I] frye re. ; 
uw: C’3C Cec ( ) 
r€R(C) reR(C) 

Now suppose that € has all small colimits, let A: C —3= € 
be any (covariant) functor and consider the tensor product functor 
~ @c A: Sets” = €. Explicitly, for any R € Sets© . the object 
Rc A of E is defined by the following coequalizer: 
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the maps @ and 7 are as described in §2(11). 

The functor ~ ®c A: Sets©” — € thus defined is an extension 
of the functor A: C — €, in the sense that, up to isomorphism, the 
diagram 


C—4— 6 


| ons A (4) 


Sets” 


commutes. Indeed, this is just the canonical isomorphism, like §2(20), 


y(C) @c A= Hom(— ,C) @c A= A(C). (5) 
Moreover, the functor — ®c A is left adjoint to the functor 


Hom, (A, —): € > Sets©” (6) 
defined, as in §2(4), for an object & in € and an object C in C, by 
Hom,(A, E£)(C) = Hom¢(A(C), £). (7) 


This adjunction (Theorem 2.1 [bis]) assumes that € is cocomplete. The 
cocompleteness hypothesis “€ has all small colimits” may be replaced by 
“€ is a topos over sets”, provided the functor A: C — € is also replaced 
by a left A(C) object (an “internal diagram”) on the category object 
A(C) in €. See V.7(5). Here A: Sets — E is the canonical functor, 
§1(8). 


Definition 1. A functor A: C — € is said to be flat if the corre- 
sponding tensor product functor ~ @c A: Sets©” — € is left exact. 


This definition, like that for modules, is analogous to Definition 5.1. 
A flat functor A: C > € induces a geometric morphism 


1(A): € > Sets®” 


with inverse image functor 7(A)* = — ®cA and direct image func- 
tor 7(A), = Hom,(A, —); indeed, 7(A)* is left exact since A is 
flat, and left adjoint to 7(A)« by Theorem 2.1 [bis]. Furthermore, a 
natural transformation 4: A — A’ induces a natural transformation 
T(A): ~ ®cA— — @c A’ in the evident way, so one obtains a functor 


r: Flat(C,€) > Hom(E, Sets©” ) (8) 
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from the category of flat functors C — € to the category of geometric 
morphisms € > Sets©”. 

In the other direction, suppose we are given a geometric mor- 
phism f: E — Sets©””. Composition of the inverse image functor 
Pie Sets©” — € with the Yoneda embedding y: C > Sets©’” yields a 
functor 


P(f)=froy: CHE. 


A comparison of the coequalizers (2) and (3) for a presheaf R € SetsC” 
immediately yields an isomorphism R @c p(f) = R@c(f* oy) = f*(R), 
natural in R. This means that rp(f) & f; in particular, p(f) is flat. 
Moreover, the fact that (4) commutes up to isomorphism for any given 
A: C — Sets implies that pr(A) = A. We thus obtain the following 
analogue of Theorem 5.2. 


Theorem 2. Let € be a topos with small colimits, and let C be any 
small category. Geometric morphisms E > Sets” correspond to flat 
functors C — €, by an equivalence of categories 


Hom(E, Sets~”’ ) <> Flat(C, €), 


as described above. 


Now let J be a Grothendieck topology on C, with the associated 
topos of sheaves Sh(C, J). In order to derive a similar correspondence 
for geometric morphisms € — Sh(C, J), we first state the following 
analogue of Lemma 5.3. 


Lemma 3. Let f: E > Sets©” be a geometric morphism, The 
following are equivalent: 


(i) f factors through the embedding 7: Sh(C, J) — Sets©”; 
(ii) f* oy maps covering sieves in C to colimits in E; 
(iii) f* oy maps covering sieves in C to epimorphic families in E, 


Proof: The proof for the case where € = Sets (Lemma 5.3) applies 
almost literally to the case of an arbitrary topos E. 


In analogy with the case of functors A — Sets discussed in Section 5, 
a flat functor A: C — € into a topos € is said to be continuous for the 
topology J if A sends covering sieves to epimorphic families in €, or 
equivalently (by the lemma), to colimits in €. Let us write 


ConFlat((C, J), €) 


for the full subcategory of Flat(C, €) consisting of the continuous flat 
functors. By Lemma 3 above, the equivalence of Theorem 2 restricts to 
an equivalence of categories as in the following corollary. 
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Corollary 4. Let (C, J) be a site, and let E be a topos with small 
colimits. Geometric morphisms € — Sh(C, J) correspond to continuous 
flat functors C — €, by an equivalence of categories restricting that of 
Theorem 2, 


We thus have a diagram of categories and functors, commutative up 
to natural isomorphisms, 


Hom(€, Sh(C, J)) <——, ConFlat((C, J), €) 


| |e 


Hom(€, Sets©’”) ——s Flat(C, €) 


in which the rows constitute equivalences of categories. 


8. Filtering Functors into a Topos 


Our next aim is to describe flat functors from a small category C 
into a topos € in more elementary terms, analogous to Theorem 6.3 
which dealt with the case € = Sets. To begin with, we shall have to 
adjust the notion of a filtering functor so that it applies to a functor 
with values in any topos €. The definition of a “filtered” category I in 
§6 required at (ii) that any two objects i and 7 of I be “joined” by a 
pair of arrows from some object k of I. Now for a functor A: C > € 
we require instead that any two objects C and D of C be “joined” by 
a family of pairs of arrows which becomes in € an epimorphic family. 
Similarly, condition (iii) in §6 required that any two parallel arrows in I 
be equalized by some arrow. Now we require that the equalizing arrows 
yield in € a suitable epimorphic family. Here is the formal definition. 


Definition 1. A functor A: C — € from a small category C into a 
topos E is said to be filtering when the following three conditions hold: 


(i) The family of all maps A(C) — 1, for all C € C, is epimorphic. 
(ii) For any two objects C, D in C, consider all objects B and all 
arrows C’ —B—D in C. Then the resulting maps 


(A(u), A(v)): A(B) > A(C) x A(D) (1) 


constitute an epimorphic family into A(C) x A(D). 

(iii) For any two parallel arrows u, v: C — D in C let Eu» be the 
equalizer in E of A(u) and A(v). Consider all objects B of C 
and all arrows w: B > C with uw = vw. Then the arrows A(w) 
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factor through E,,, to give an epimorphic family of maps (here 
dotted) into Ey»: 


A(w) A(u) 
A(B) —— A(C) aa A(D) 


, 
\ 
| 
\ 
, 

v 


Buy. 


These three conditions are simply diagrammatic versions of the con- 
ditions (i)—(iii) stated below Definition 6.2. For example, condition (iii) 
there states that any element a € A(C’) which is in the set-equalizer E,, , 
lies in the image of some A(w). 

To manage these epimorphic families, we will need: 


Lemma 2. For any categories D and C, any functor L: D > C 
which has a right adjoint R takes epimorphic families in ‘D to epimorphic 
families in C. 


Proof: Suppose that {e;: U; — B} is an epimorphic family in D 
while s, £: LB — C are maps in C such that sL(e;) = tL(e;) for all 7. 
Then by taking the transposed maps 3 and # along the adjunction we 
have 

Se; = te;: U; - B= RC 


for all 7, by the naturality of the adjunction. Since the e; form an 
epimorphic family, this gives § = f and hence s = ¢, as required. 

In particular, this means that pullback along an arrow k: B — B’ 
in a topos € preserves epimorphic families { e;: U; — B’}. Indeed, the 
pullback k*: E/B' — €/B has a right adjoint (§IV.7), hence sends {e;} 
to an epimorphic family { k*(e;) = e;x 1: U;xg  B- B’ xp B= Bh in 
€/B. Since €/B — E also has a right adjoint, this family is epimorphic 
in € as well. 

Our purpose is to prove eventually that a functor A: C > € froma 
small category C into a cocomplete topos € is flat iff it is filtering. But 
we will only achieve this towards the end of the next section. First, it 
will require several subtle but equivalent descriptions of filtering functors 
A: C— €. The first one, as stated in the following lemma, is essentially 
a translation of Definition 1 above into the language of “generalized 
elements”. 


Lemma 3. A functor A: C — € is filtering iff it satisfies the fol- 
lowing three conditions: 


(i!) For any object U € € (that is, for any generalized element 
U — 1 of the terminal object 1) exists some enimorphic family 
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{e;: U; — U} in € and for each index i an object B; of C and a 
generalized element U; — A(B;) in E. 

(ii’) For any two objects C and D in C and any generalized element 
(c,d): U = A(C) x A(D) in E, there is an epimorphic family 
{e;: U; ~ U} in E and for each index i an object B; of C with 
arrows u;: B; — C' and v;: B; — D in C and a generalized 
element b;: U; — A(B;) in €, all such that the following diagram 
commutes: 


U; ———— U 


| | cd) (3) 


ABs) Tapayateay AO) x AP). 

For any two parallel arrows u, v: C — D inC and any generalized 
element c: U > A(C) in € for which A(u)(c) = A(v)(c), there 
is an epimorphic family { e;: U; > U } in € and for each index i 
an arrow w;: B; — C' and a generalized element b;: U; — A(B;) 
such that the following diagrams commute in €, respectively in 


(iii 


—, 


C: 
U; ae ee ff B; 
| PS Je (4) 
A(u) ‘i 
A(Bi) "aan A(C) rik A(D), Ca: 


[In fact, (i') is equivalent to (i), (ii’) to (ii), and (iii’) to (iii).] 
Proof: (i)=(i’) For U € €, construct for each object C' € C the 


pullback 
Ug ———> A(C) 


_ ° 


U ———— 1. 
By assumption, the family of maps on the right (indexed by C’) is an 
epimorphic family, which by pullback gives the epimorphic family of 
arrows Uc — U on the left while each Uc — A(C) is a generalized 
element. 
(i')=>(i) In (i'), take U = 1. The resulting epimorphic family {e;} is 
then expressed in terms of the generalized elements of A(B;) as 


(e:: Uj 3 1) = (U; — A(Bi) > 1). 


Hence the right-hand family of arrows A(B;) — 1 is epimorphic, and (i) 
follows. 
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(ii)=>(ii’) The epimorphic family (1) given in (ii) pulls back along 
the given generalized element (c,d) to give an epimorphic family U; > 
A(B;) with index 7 running over all diagrams C — B — D in C (where 
C and D are fixed), such that the squares of the form (3) commute. 

(ii’)= (ii) Apply (ii’) with U = A(C) x A(D) and (c,d) the iden- 
tity. Since the arrows e; in (3) form an epimorphic family, so do the 
bottom arrows in (3). Thus the family (1) which they form must also 
be epimorphic. 

The similar proof of the equivalence (iii)<(iii’) is left to the reader. 

In Section 6, we have shown that for a filtering functor A: C — Sets, 
there is a cone on any finite diagram in the category {(, A of elements 
of A. There is an analogous fact for filtering functors A: C —- € into a 
topos €, but its statement is more involved and requires the construction 
of categories of generalized elements. Consider any functor A: C — €, 
and let U be an object of €. By composition with Homg(U, ~): E > 
Sets, we obtain a functor 


Hom, (U, A): C — Sets, Hom, (U, A)(C) = Home (U, A(C)). 


We can thus apply the Grothendieck construction of §6 to obtain the 
category of elements of this functor into sets—that is, the category of 
generalized elements defined over U of the functor A, denoted by AU: 


ef Hom, (U, A). (6) 
Cc 


In other words, the objects of AY are pairs (C,c: U > A(C)), with C in 
C and c an arrow in €; a morphism (C,c) > (C’,c’) in AU is an arrow 
u: C > C’ in C with the property that c’ = A(u) oc: U > A(C’). 

If e: U’ = U is an arrow in E, there is a functor 


e#: AU AU’ (7) 


defined in the obvious way by composition with e. Explicitly, for an 
object (C,c: U > A(C)) of AY, set 


e#(C,c) = (C, coe). (8) 


In particular, each diagram D in the category AY yields, by applying 
the functor e# to D, a diagram e#(D) in AU’. 


Lemma 4. A functor A: C — € is filtering iff for each object U of 
E and each finite diagram D in AY, there exists an epimorphic family 
{ e;: U; > U} such that, for each index i, there is a cone on the induced 
diagram e;#(D) in AU. 
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Proof: The proof is analogous to that of Lemma 6.1. 

(<=) Assume that A satisfies the condition on finite diagrams as 
stated in the lemma. We shall prove that conditions (i’)-(iii’) of 
Lemma 3 hold. First, consider for a given object U of € the empty 
diagram D in AY. For a map e;: Uj; > U, e;*(D) is the empty diagram 
in AUY:, and a cone on this diagram is simply an object of Au: ie, a 
pair (C; € C,c;: Uj; — A(C;)). So clearly condition (i’) holds. 

To prove (ii’), take an object U of €, and arrows c: U > A(C) 
and d: U — A(D) in €. Consider the diagram D in AY consisting of 
just two objects, viz., (C,c) and (D,d). Then, by hypothesis, there is an 
epimorphic family { e;: U; > U } such that for each index i there exists a 
cone on the diagram e# (D) in AY, Now e# (D) is the diagram consisting 
of just two objects (C, ce;) and (D, de;), and a cone on it in A™ is given 
by arrows u;: B; — C and v;: B; — D in C, together with an arrow 
b:: U; — A(B;) in € such that A(u;)(b:) = ce; and A(v,;)(b;) = des, 
exactly as required in condition (ii’) of Lemma 3. 

Finally, condition (iii’) follows by a similar argument with diagrams 
in AY of the form ese. 

(=) Now assume that the functor A: C — € is filtering, so that 
conditions (i’)—(iii’) of Lemma 3 hold. The required cone will be then 
constructed much as in Lemma 6.1, by induction on the number of ar- 
rows in the given diagram D in AY. We will consider only the step 
in which a new arrow t: (C,c) — (D,d) is to be added to D—where 
D already contains the objects (C,c) and (D,d). By the induction as- 
sumption, there is already an epimorphic family e;: U; — U in € and 
for each index i a cone on e;*(D) with a vertex (B;, b;), as in the figure 
below in the category AU: , 


(B;, b:) 


| Se, (9) 


(C, ces) —=— (D, de;) ——— (D', d’e;) ——— (D",d"e;)... 


where all the triangles commute, except the first one where perhaps 
tu # uv: B; > D. Since both components were arrows in A¥:, one does 
have A(tu)b; = de; = A(v)(b;). Condition (iii’) of Lemma 3 therefore 
gives, for each 7, an epimorphic family 


{ fig: Vig Ui; (10) 


in €, and for each index j an arrow w;;: Li; > B; in C and a generalized 
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element €,;;: Viz — A(,;) such that the diagram 


Vay, 
“| |» 
A(L By) 24 ap 
a aga aa 


commutes. Therefore we find, for each i and j, a cone on the diagram 
fies#(D) in A”, constructed by applying fi;* to (9) and then putting 
wig? (Liz, &ij) - (Bi, bi fiz) on top of it. But for each i, the family 
{ Vij — Ui }3 of (10) is epimorphic, as is the family {U; — U},. There- 
fore, so is the family of composites { V;; > U; - U },;. This completes 
the induction. 
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The main result of this chapter (Corollary 9.2 below) states for a 
topos with small colimits € and a site (C, J) that geometric morphisms 
E€ — Sh(C, J) correspond to functors A: C — € which are continuous 
and filtering. [In the language of the next chapter, one says that Sh(C, J) 
“classifies” continuous filtering functors on C.] This result will be an 
immediate consequence of Corollary 7.4 and the following result which 
generalizes Theorem 6.3 from Sets to a topos: 


Theorem 1. Let € be a topos with small colimits, and let C be a 
small category. Then a functor A: C — € is flat iff it is filtering. 


The proof of this result is somewhat long and technical, and the 
reader may prefer to postpone its proof and study some of the applica- 
tions (e.g., in §§VIIL.3-6) first. 

Proof: (=) Suppose that the given functor A: C — € is flat, and 
so has the property that — ®c A: Sets°? — E€ preserves finite limits. 
We have to check that conditions (i)—(iii) of Definition 8.1 hold. Recall 
that for a presheaf R € Sets©”, the object R @c A of E is defined as 
the following coequalizer (1) in € [identical to §7(3)| 


In particular, since — ®c A preserves limits, we have 1@c A & 1, so 
Lcec A(C) - 1 is epi; ie., condition (i) of Definition 8.1 holds. To 
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check condition (ii), “join two objects”, take two objects C and D of C, 
and consider the diagram 


Ll] 42)= [] 48 —*— wee) x y(D)) @c A 


BoC BOC 
v v 
B--D B-+D 
t 
B’—+B aye (2) 


A(C) x A(D) <= (yC ®c A) x (yD ®c A). 


Here the upper row is the coequalizer defining (yC x yD) ®@c A, v is 
given on each summand A(B) as in (ii) of Definition 8.1, and a@ is the 
canonical isomorphism which states that ~ ®c A preserves the product 
yC x yD; finally, the two y’s on the bottom are canonical isomorphisms 
of the type §7(5). By spelling out the definitions, one readily checks that 
the square on the right commutes. But ¢ is a coequalizer, hence epi, so 
v must be epi as well. This shows that (ii) of Definition 8.1 holds. 

Finally, one can verify in a similar way that condition (iii), “equalize 
two arrows”, follows from the fact that for any given arrows u, v: C — D 
in C, the tensor product — ®c A must preserve the equalizer 


yu 
P-—— yC rs yD 


in Sets©”, where the presheaf P is defined for each object B by P(B) = 

{w|w: BC in C and uw = vw}. We leave the details to the reader. 
(<=) Let us now turn to the converse. Suppose that the given functor 

A: C — € satisfies (i)—(iii) of Definition 8.1, or equivalently (i')—(iii’) 

of Lemma 8.3 for a filtering functor. We will prove that the functor 

~ @c A preserves the terminal object as well as all pullbacks. From 

this it follows that — @c A preserves all finite limits ((CW™M, p. 109}). 
In order to show that 1®c A = 1, consider the diagram: 


I] 4¢) = [][4@ * 18c4 
cc a 


(3) 


LT] 48) [[4©@ x 4) $3 [J 4c) — 1. 
C,D Cc 


CBD 
Here the top row is the coequalizer defining 1®c A [so 6 sends the u 
summand A(C’) to A(C) via A(u): A(C’) > A(C) and 7 sends it to 


9. Geometric Morphisms as Filtering Functors 401 


A(C") via the identity]. By assumption on A, the map [[, A(C) > 1 
is epi. Hence, by Theorem IV.7.8, it is a coequalizer of its kernel pair; 
i.e., of the two projections from ([[, A(C)) x ([[¢ A(C)). Distributing 
both these coproducts over the product, this means that the right-hand 
part of the bottom row in (3) is a coequalizer. Moreover, by assumption 
on A, as in (1) of Definition 8.1, the evident map v is an epimorphism. 
Thus, for any object FE of the topos € and any a: [[, A(C) > Fin €, 
with components ac: A(C) > E, say, we have 


ad=ar << forallu: C’ —C, aco A(u) =ac 


<=> forallC — Bs D, acA(u) = ag = apA(v) 
—_= QM |Y = ANgv 


<> am, =an2 (since v is epi). 


It follows that (7,@) and (71, 72) have isomorphic coequalizers; and hence 
that 1®@cA=1. 

Next, we show that — @c A: Sets©” 3 € preserves pullbacks. The 
proof will use a suggestive notation for maps into tensor products. If 
R€ Sets©”” is any presheaf, with associated tensor product as in (1) 
above, then for any Co € C and any rp € R(Co), we write 


79 ® —: A(Co) > R@cA (4) 


for the composition of the coequalizer ¢: [] A(C) — R@c A of (1) and 
the coproduct inclusion A(Co) — [] A(C) to the summand indexed by ro 
and Co. Then if u: C; — Cp is any map in C, the usual tensor identity 


rou®@ — = ro @u-— holds; or more precisely, rpu®@ — = (ro@ —) 0 
A(u): 
A(C\) _ A(Co) 
rou® — ro ® — (4’) 
Rc A. 


Notice that the family of all such maps rp ® — [for all Co © C and 
all ro € R(Co)| is an epimorphic family into R @c A—indeed, their 
coproduct is the epimorphism ¢ in the coequalizer definition (1) of ®c. 

We shall first consider the special case of a pullback between rep- 
resentable presheaves. (The general case follows easily from this since 
every presheaf is a colimit of representables; cf. Lemma C below.) So 
consider in Sets©” such a pullback R of three representable presheaves, 
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as in the diagram 


f 


Ra ye 


i : 


yD —_,— yB 


where (by Yoneda) u and v are given by maps (still called) u: C — B 
and v: D — Bin C. Then for each object E in C the pullback set R(E) 
is described as 


R(E)={w: E3C,v': E> D| uw =0'} (6) 


and 

fe(u,v)=u, ga(u',v')=0'. (7) 
Now apply the functor — ®c A to the pullback (5). Now yC@cA & 
A(C) and similarly for B and D, so this gives a pullback U in € with 
arrows c, d and a commutative diagram of the form 


R®cA 


\ Ke f 
“SS 


A(D) —3--—> A(B). 


Here o is the canonical map from the value of the functor — ®c A on 
the pullback R to the pullback U of the values, modulo the canonical 
isomorphisms y(B) ®c A = A(B) (and similarly for C’,, D) as in §7(5). 


More explicitly, f: R@c A— A(C) is the evident composition 
F: R®c AL24 y(C) @c AX AC) 


of f @ A with the isomorphism §7(5). In terms of the maps in (4), f can 
alternatively be described as the unique map such that for each E € Cc 
and each r = (u’, v’) € R(E) as in (6) one has f(r ® —) = A(u’)« = uh, 
as in 


A(E) (uw) ® - PEGA 
A(u’)=u', 7 (9) 


A(C). 
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The corresponding description applies to @. 

To say that — @c A preserves the pullback (5) means that o in (8) 
is an isomorphism. We now show, in two separate lemmas, first that o 
is epi and then that it is mono. 


Lemma A. The map o of (8) is an epimorphism. 


Proof: The diagram (8) for each index u, v, with v: D > B, yields 
the following diagram D in the category AY of generalized elements of 
A 


2 


(D,d: U > A(D)) (C,c: U + A(C)) 


eNO oo 


(B,v,d = usc: U > A(B)). 


By Lemma 8.4, there is an epimorphic family {e;: U; > U} in € anda 
corresponding family of generalized elements (B;,b;: U; — A(B;)), such 
that each (B;,6;) is the vertex of a cone on the diagram e;#(D) in AU, 
with arrows v;: B; — D and u;: B; — C as in 


(B;, 5;) 


aaa 


(D, de;) (C, ce;) (11) 


(B, v, de; => U4 Ce; ). 


Thus, wu; = vv;: B, > B in C, and 


A(u;)(b;) = Usa (dg) = ce, Vix (dy) = de;. (12) 
In the topos €, we thus obtain for each index 7 a diagram 
A(B;) ——— U; 
ne] |: (13) 
R&c A—z— U,7 


for o as in (8), which is easily seen to commute. Indeed, U is a pullback 
(8), and 


cog 0((u4, 1) @ —) 0b; = fo((ui, %) ® —) ob; [by (8)] 
= (Ui )«bi [by (9)] 
=coeg [by (12)), 
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and in the same way dogo ((ui,vj;)® —) 0b; = de;. Therefore 
o(( ui, vi) ® — )b; = e;, showing the commutativity of (13). Since the 
e;: U; — U are collectively epi, it follows that o is an epimorphism. 


Lemma B. The map o of (8) is a monomorphism. 


Proof: Consider any two maps ¢, : V > R@c A in € such that 
og =o: V > ROCA U. 


We wish to show that @ = . Take the family of all objects E; with 
arrows uj, v; in C which fit into a commutative square with the given 
u and v of (5) 


»| | UU = VV;. (14) 


Then by the definition (1) of the tensor product and the definition (6) 
of the presheaf R, these together give an epimorphic family of maps in 
E: 

(( uj, 03) @ —): A(Ej) > R@cA (15) 


[these are maps of the form (4)]. Next, construct for each pair of indices 
4, k the diagram 


V; 
yee p.b. ae 
Vi gg ee 18) 
+| p.b. | p.b. |v 
A(E}) ((u,0,) ® —) RCA ae 2) A(Ex); 


where ¢; is the pullback along ¢, yw that along 7, and the upper square 
(distorted) is also a pullback. Since pulling back preserves epimorphic 
families (§7), it follows that {2;: V; > V}; and {Ax: Ve > V }x are 
both epimorphic families, and, for the same reason, so is the family of 
composites { 1j0j;% = AK Bjn: Vik > V }jx- We claim that for any pair 
of indices j, k the two outer composites in (16) are equal: 


(( uj, 05) ® — )bjaye = (( ue, Ve) @ — PRA jE (17) 
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Before we show (17), let us remark that Lemma B follows once (17) is 
proved. Indeed, by commutativity of the two lower squares in (16), (17) 
yields éajajx = WAR G;jk. Hence, since the maps mjajk = Ax Gyx (all Jj, 
k) form an epimorphic family into V, we conclude that ¢ = w. 

To prove (17), consider for each pair of indices j, k the following 
diagram, obtained from (16) and (14): 


A(E;) —-— A(C 


oy Ne 
Fi (18) 
<< 


A( Ex) yz A(D 


The diagram can be interpreted as a diagram D in the category of ele- 
ments AY%* defined over V;,. So by Lemma 8.4 for the filtering functor 
A, there exists an epimorphic family 


{ee: Ve Vik be 


and for each index € a cone on eg#(D) in AVYé, Such a cone is given by 
a vertex 


(Me, me: Ve > A(Me)) 


and arrows from this vertex, 
Wej: (Me, mg) —— (E;, Pj O5keE), 


wer: (Me,me) —— (Ex, Ue Bjree)- 


In particular, since this is a cone, the diagram (19) below in € commutes 
and the identities in (20) hold: 


A( Mg) —"— Ve —"S—+ A(Mg) 
A(we, } |e |Atwa)=we (19) 
A(Ex), 


A(E;) 3% ke Ik by Bik 


UjWej = URWER, Uj Wey = Uk Wer. (20) 
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But then 
(( Uj, 05) @ ~ \bjazjnee = (( uz, 5) @ —)A(wes)me [by (19)] 
= ((ujwe;, vjwe;)® —)me [by (4*)] 
= ((URWER, VEWer) @ — )Me [by (20)] 
= ((Uk, Uk) @ ~)A(wen)me [by (4’)] 
= ({Uk, Ue) @ — ) Pe Ajnee [by (19)] 


Since the arrows e¢ form an epimorphic family, we conclude that (17) 
holds, as was to be shown. This completes the proof of Lemma B. 


We have now shown that — ®q A preserves pullbacks of representa- 
bles [as in (5)}. To complete the proof of Theorem 1, it thus suffices 
to prove the following lemma, since ~ ®c A certainly commutes with 
colimits [it has a right adjoint Hom(A, —~ )]. 


Lemma C. Let é: Sets’” —. € be any functor which preserves 
colimits and pullbacks of maps between representables [i.e., pullbacks of 
the form (5)}]. Then € preserves pullbacks. 


Proof: Consider any pullback diagram of presheaves 
Q xp R——>R 


| | 


Q ————  P. 


Write P as a colimit of representables, say as P = lim Pi, and let the 
presheaves Q;, R; be the pullbacks as in 


Q; —-> PF; ———__ R; 


| | | 


Q +P R. 


Next write each R; and each Q; as a colimit of representables, say 
R; & bs Riz and Q; = lim, x Qiz. Since, in a topos, pulling 


+ 


back commutes with colimits, we have 
lim. ine lim (Qix Xp, Riz) = lim (Qi xp, Ri) (21) 
“QxpR, 


and by the same argument, since € preserves colimits, 


lim, lim lim, (€(Qiy) ec.) §(Rig)) = 6(@) Xecp) (R)- (22) 
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But by assumption, € preserves pullbacks of the form Qix x p, Rij, as well 
as colimits, so the left-hand side of this expression (22) is isomorphic to 
&(lim. lin . lim (Qik x p, Rjz)), and hence by (21) to (Q xp R). Thus, 
&(Q Xp R) = &(Q) Xe(p) €(R), as was to be shown. 

This concludes the proof of Theorem 1. It yields a main result as 
follows. 


Corollary 2. Let € be a topos with small colimits, and let (C, J) 
be a site, There is an equivalence of categories between the category 
Hom(€, Sh(C, J)) of geometric morphisms € — Sh(C, J) and the cate- 
gory of continuous filtering functors C > E€. 


Proof: Immediate from Theorem 1 and Corollary 7.4 which gave 
this result for continuous flat functors. 


Corollary 3. Let C be a small category with finite limits, and let 
E bea topos with small colimits. A functor A: C — € is flat iff A is left 
exact. 


Proof: This follows from Theorem 1 by the same proof as that of 
Corollary 6.4 (of Theorem 6.3). 


Corollary 4, Let (C, J) bea site where C has finite limits, and let 
E be a topos with small colimits. There is an equivalence of categories 
between geometric morphisms E — Sh(C, J) and continuous left exact 
functors C — €. 
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In the previous sections, we have considered morphisms from a topos 
E into a Grothendieck topos Sh(C, J) of sheaves on a site (C, J): they 
were described in terms of continuous filtering functors C > €. If € is 
itself a Grothendieck topos, say € = Sh(D, K) for a site (D, K), then 
it suffices to consider objects U, U; from D in using the conditions (i’)- 
(iii’) of Lemma 8.3 for filtering functors. In this way, one obtains the 
following description of such geometric morphisms. (Observe that the 
two objects C, D of C in §8 here are written as C, C’.) 


Theorem 1. Let (D, K) beasite. There is an equivalence between 
geometric morphisms Sh(D,K) — Sh(C, J) , and functors A: C > 
Sh(D, K) which have the following four properties (the first three express 
the fact that A is filtering, the last one that A is continuous): 


(i) For any object D of D, D is covered by the set of arrows described 
as{g: D'— D| for some C € C, A(C)(D’) ¥ 9}. 

(ii) Consider objects C, C’ of C and an object D of D. For each 
pair of elements a € A(C)(D) and a’ € A(C’)(D) the object D is 
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covered by the set of those arrows g: D' = D which “join” a to 
a’ in the sense that there are u, u’, and b with 


Ca pao and = bE A(B)(D’) 
such that 
A(ujb=a-g and A(u')(b) =a’ -g. 
Consider arrows u, v: C — C’ in C and an object D of D. For 
each element a € A(C)(D) with A(u)a = A(v)a, the object D is 


covered by the set of those arrows g: D' — D for which there are 
w and b with 


(iii 


— 


w:BoCinC and b€ A(B)(D’) 
such that 
uw=vw and A(w)(b) = ag. 


(iv) Given a cover S € J(C), an object D € D and an element a € 
A(C)(D), the object D is covered by the set of those arrows 
g: D' = D for which there are u and b with 


u:C’>CinS and be A(C’)D' 


such that 
A(u)b = ag. 


In condition (iii), observe that the last displayed equations insure 
that A(u)ag = A(v)ag. 

Proof: Again consider epimorphic families. Note that a family 
{e;: U; — U} of morphisms of sheaves U;, U on (D,K) is an epi- 
morphic family (i.e., the induced map [[U, > U is an epimorphism of 
sheaves) iff, for every object D in D and every a € U(D), the object D 
is covered by the set of those arrows g: D’ — D for which there are an 
index i and 6 with b € U,(D’) such that 


e;(b) = ag. 


Indeed, this follows from Corollary III.7.6 in exactly the same way as 
Corollary III.7.7 there does. In particular, condition (iv) of the theorem 
now says exactly that for a covering sieve S of an object C in C, the 
family { A(u): A(C’) + A(C) | u € S} of morphisms of sheaves on 
(D, K) is an epimorphic family. In other words, A satisfies (iv) iff A 
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is continuous. In a similar fashion, using the description of epimorphic 
families of sheaves just given, conditions (i)—(iii) of the theorem corre- 
spond exactly to the epimorphic family conditions (i)—(iii), respectively, 
of Definition 8.1 of a filtering functor to a topos € for the special case 
where € = Sh(D, Kk). 

We shall now consider several types of functors between sites, and 
the geometric morphisms these functors induce. 

Suppose (C, J) and (D, /) are two sites, and suppose C and D are 
both closed under finite limits (given a Grothendieck topos, such a site 
always exists for this topos, see Giraud’s theorem in the Appendix). 
Grothendieck & Co. call a functor 6: C — D a morphism of sites if 
preserves finite limits as well as covers. Here ¢ preserves covers means 
that if S € J(C’) is a covering sieve of C' in C, then the sieve (¢(S)) 
generated by the set { ¢(u) | u: C’ > C is in S} is a covering sieve of 
g(C) in D. 


Theorem 2. For categories C and D with finite limits, any 
such morphism of sites ¢: C — D induces a geometric morphism 
f: Sh(D, K) — Sh(C, J); the direct image functor f,: Sh(D,K) > 
Sh(C, J) sends a sheaf F on D to the composition f,(F') = Fo ¢, and 
the inverse image f*: Sh(C, J) — Sh(D, K) sends a sheaf G on C to 
the tensor product G @c Ag, where 


Ag =aoyog: c-*. D_—_* > Sets?” —*+— + Sh(D, Kk). 


Proof: The Yoneda embedding y: D > Sets?” and the associated 
sheaf functor a are both left exact while ¢ is assumed to be left exact. 
Therefore, the composite functor A,: C — Sh(D, K) defined above is 
left exact. By Corollary 9.3, Ag is flat. Moreover, by assumption, ¢ 
sends covers in C to covers in D, while aoy: D > Sh(D, K) then sends 
these to epimorphic families (cf. Corollary III.7.7.). Hence, the functor 
Ag: C > Sh(D, K) is also continuous. By Corollary 7.4, it thus induces 
a geometric morphism f = 7(Ay) with f* = ~ @c Ag, as asserted in 
the theorem, and with f, = Homgy(p x)(Ag, —). Thus, given a sheaf 
F on the site (D, K) and an object D in D one has 


f.(F)(D) = Homgyp,x)(Ag(P), F) 
= Homs,p x) (ay¢(D), F) = (Fo ¢)(D), 


the latter because the sheafification a is left adjoint to the inclusion 
Sh(D, K) Sets?” and by the Yoneda lemma. Thus, up to a natural 
isomorphism, f, = F o ¢, as stated in the theorem. 

The example of a geometric morphism f: Sh(X) > Sh(Y) induced 
by a continuous map of topological spaces f: X — Y [Theorem II.9.2 
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and §1(1)] can be seen as an instance of this Theorem 2. Indeed, the 
inverse image map f—!: O(Y) — O(X) preserves finite meets, which are 
the finite limits in the lattice O(Y) [resp. O(X)], and it preserves covers 
because these are just unions of open sets. We will encounter several 
other applications of Theorem 2 in the following chapters. 

If the sites (C, J) and (D, K) do not have finite limits, it can be 
quite cumbersome to check for a given functor 6: C — D whether the 
composite Ay = ay¢: C — Sh(D, K) is flat. In principle, one should 
verify the conditions of Theorem 1, but in some cases one can simply 
apply the following lemma. Its statement requires the following defini- 
tions: A functor 7: D — C is said to have the covering lifting property 
(clp) if for any object D of D and any J-cover S of x(D), there exists a 
K-cover R of D such that 1(R) = {z(u) |u € R} C S. In other words, 
every cover of the image of an object D in D is refined by the image of a 
cover of D itself. (One also says that the functor 7 is “cocontinuous” .) 


Lemma 3. Let (C, J) and (D, K) be sites, and suppose we are given 
two functors 7: D — C and ¢: C — D with 7 left adjoint to ¢. Then 


(i) yog: C — D Sets” is flat [and a fortiori so is ayp = 
Ag: C > Sh(D, K)]; 
(ii) @ preserves covers iff x has the covering lifting property. 


Proof: (i) Notice first that the associated sheaf functor a commutes 


with tensor products; explicitly, for any functor A: C > Sets?” and 
any sheaf R on (C, J), 


a(R @c A) ~ R@c(aA), (1) 


where the tensor product R@c A on the left is taken in Sets?’”, 
while the one on the right is in Sh(D, AK). Indeed, this follows since 
the tensor product is defined as a coequalizer, and a~-being a left 
adjoint—preserves coequalizers. In particular, applying this to the func- 
tor yod: C5 Sets?” | we find that if yo@ is flat then so is A, = aoyod, 
by (1) and the left exactness of a. This explains the “a fortior?’ in (i) 
of the lemma. 

Also, for a presheaf R on C and an object D of D, there are natural 
isomorphisms 


(R@c(y °¢))(D) = R®ce D(D, ¢—) 
© R@®oD(rD, —) 
= R(xD). 


(The first isomorphism holds because colimits in Sets>”” are computed 
pointwise, the second by the given adjunction 7 4 @, the third since 
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representables are units for the tensor product.) Thus, there is a natural 
isomorphism 


— @c(y¢) = 1*: Sets™” — Sets?””, (2) 


where, as in §2, we write m* for the functor given by composition with 
a: D — C. But clearly z* is left exact; so y¢ is flat. 

(ii) (=) Assume that @ preserves covers, Let D be an object of 
D and let S € J(aD) be a covering sieve in C. Consider the unit 
np: D > ox(D) of the adjunction 7 4 ¢. By the assumption on 4, 
the sieve (¢(S)) generated by { ¢(u) | u € S} covers da(D), so by the 
stability axiom for Grothendieck topologies its pullback [see (3) below] 


nb (O(S)) = {9: D' > D| 3u: C > x(D) in S such that 
Np og factors through ¢(u) } 
covers D. But if an arrow g: D' — D has the property that np og 


factors through ¢(u), then by transposing along the adjunction as in (3) 
below z(g) must factor through u; so m(g) € S: 


a > o(C) me Dy exnttses *C 
| fe ro | (3) 
D—p— ¢n(D), n(D) n(D). 


Thus, 1(7%(¢(S))) C S; i.e., n&(6(S)) is a cover of D whose 7-image 
refines the given cover S of 7(D). This shows that 7 has the clp. 

(<=) Assume that 7 has the clp, and let the sieve R € J(C) be a 
cover of C' in C. Pulling back along the counit €: 7¢(C’) — C of the 
adjunction, we find that «#(R) is a cover of 7¢(C’). Hence, since m has 
the clp, there must exist a cover S of (C’) in D such that 1(S) C e#(R). 
By transposing along the adjunction [as in (3)], one readily finds that 
m(S) C e#R yields S C (¢(R)). Thus, (¢(R)) is a cover. This shows 
that ¢ preserves covers, and completes the proof of the lemma. 


By the results of §7, this lemma implies that an adjoint pair 
n: D@C : @ satisfying the equivalent conditions of Lemma 3(ii) 
will induce a geometric morphism f: Sh(D, K) — Sh(C, J); indeed 
Ag = aoyog: C = Sh(D, K) is flat by Lemma 3(i), and continuous 
since ¢ preserves covers (as in the first part of the proof of Theorem 2). 
For future reference, we describe the inverse and direct image functors 
explicitly, as in the following two theorems, which we will apply in this 
form in Chapter IX. 
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Theorem 4. Let (C,J) and (D, K) be sites, and let x: D = C 
and ¢: C — D be functors such that 1 is left adjoint to ¢. If 7 has 
the clp, or equivalently if @ preserves covers, then there is an induced 
geometric morphism f: Sh(D, K) — Sh(C, J), with inverse and direct 
image functors described, for sheaves F on (C, J) and G on (D, Kk), by 


f*(F) =a(F on), f.(G) = Gog. 
Proof: As before, consider the functor Ay = ay¢: C — Sh(D, Kk). 
As stated just above the theorem, A, is flat and continuous, and hence 


it induces a geometric morphism f: Sh(D, K) — Sh(C, J) with f* = 
— @c Ay and f, = Hom(Ay, — ). Thus, for a sheaf F' on (C, J), 


f'(F) =F @c Ay 


= F ®ca(y¢) 

=a(F@c(yd)) [by (1) above 

= a(r*(F)) [by (2) above] 
=a(Fon) (by definition of 7*). 


Also, for a sheaf G on (D, K), since the associated sheaf functor a is left 
adjoint to the inclusion of sheaves in presheaves, we obtain the following 
isomorphism in Sh(C, J): 


f.(G) = Homg, (Ag, G) = Homp(y¢, G) = Homp(D(—, ¢—),G), 


where Homp denotes morphisms of presheaves on D. So for an object 
C in C, by the Yoneda lemma, 


f.(G)(C) = Homp(D(— , 6C), G) & G(¢C). 


Thus f,(G) = Go @, as stated in the theorem. 


For completeness, we next observe that just the functor 7: D — C 
alone suffices to give a geometric morphism of sheaves, provided x has 
the covering lifting property. (But we have no simple description of the 
direct image functor in this case.) 


Theorem 5. Let 7: D > C be a functor having the covering lift- 
ing property. Then 7 induces a flat and continuous functor A,;: C > 
Sh(D, K), defined by A,(C) = ac C(x—,C), and hence a geometric 
morphism f: Sh(D, K) — Sh(C, J) with inverse image f*(F) © a( For) 
for any sheaf F on C (as in Theorem 4). 
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Proof: For the continuity of A,, consider an object D € D, a cover 
S of C, and any arrow g: 7D — C in C. Then by properties of covers 
g* (S) is a cover of t(D) in C. Then since x has the clp, there is a cover 
R of D such that each v: D' — D in R fits into a commutative square 


nD! ------: >C! 
“| I wEeES,veR, 


CD U, 


for some u € S. This result means that the following evident map W of 
a coproduct of presheaves on D 


U: I] C@-,c)-cr-,c) 
(u: C/3C)ES 


is “locally surjective” in the sense described in Corollaries 5 and 6 of 
SIII.7. Indeed, the diagram just before states that for each element 
in the codomain on the right (i.e., for each object D in D and each 
g: ™D — C) there is a cover R of the object D such that g7(r) is in 
the image of W for every r in the cover R. Then Corollary III.7.6 states 
that aV is an epimorphism of sheaves. By definition of continuity and 
of A, = aoC(x-, ~), this means that A, is continuous. Moreover, as 
before [cf. (1)], we have for a sheaf F on C 


Fc Ax = F @c(aC(n—, — )) 
=a(F @cC(r—, ~)); 


where the last tensor product is taken in the presheaf category Sets?’”. 
But for any object D in D, (F ®ce C(a—, —))(D) = F@cC(aD, -) = 
F (aD). So the functor ~ @®c A,;: Sh(C, J) > Sh(D, K) is isomorphic 


to the composite aox*: Sh(C, J) C Sets®” , Sets?” — Sh(D, K). 
Since a and z* are both left exact, this shows that A, is flat, and that 
the inverse image f* = ~ ®c A, of the resulting geometric morphism 
indeed sends a sheaf F' on C to a(F'o7), up to natural isomorphism. 


As a typical example, let T be a small category of topological spaces, 
closed under finite limits and open subspaces, and let X be a fixed object 
of T. There is a natural Grothendieck topology on T, whose basic covers 
are of the form 

{U; ———> T}, 


where {U;} is a cover of the space T by open subspaces of T, and T is 
any object from T (see §III.2, Example b). The same construction gives 
a Grothendieck topology on the slice category T/X. 
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Now consider the inclusion functor 
i: O(X)-—— T/X; (4) 


i sends an open set U C X to the inclusion map U — X, an object of 
T/X. Now O(X), with the usual open cover topology, is a site (the 
standard one) for the topos Sh(X) of sheaves on X. Clearly, the above 
functor i of sites is left exact, preserves covers, and has the covering 
lifting property. Consequently, according to Theorems 2 and 5, 7 induces 
two geometric morphisms in opposite directions 


Sh(X) == Sh(‘T/X). (5) 


Here, according to the construction of f in Theorem 2, f, sends a sheaf 
F on T/X into the sheaf f,(F) = Foi: O(X)°? — Sets on X, while 
according to Theorem 5, j* sends F' to j*(F) = aoF oi. But Foi = f,(F) 
is already a sheaf, so j*(F) = Foi. Thus, these are two successive 
adjunctions 

fAafap i. (6) 
It also follows that for the first adjunction the counit (jf)* = f*j* = 
f*f — 1 gives an arrow jf > 1 in Hom(Sh(T/X), Sh(T/X)), while 
the counit (fj), = faj» = j*j« — 1 of the second adjunction gives an 
arrow 1 > fj in Hom(Sh(X), Sh(X)) (the latter arrow 1 — fj is in fact 
an isomorphism—Exercise 11). 

These arrows jf — 1 and 1— fj between geometric morphisms may 
be thought of as homotopies. In this sense we have shown that Sh(X) 
and Sh(T/X) are “homotopy equivalent” topoi. (It follows immediately 
that they indeed have the same homotopy and cohomology groups, al- 
though we will not go into this here.) Notice also that Sh(T/X) is 
equivalent to the slice topos Sh(T)/y(X), where y(X) € Sh(T) is the 
representable sheaf corresponding to X (see Chapter III, Exercise 10). 
In the French school, one calls Sh(X) the petit (small) topos associated 
with the space X, and Sh(T)/y(X) the gros (large) topos associated 
with the same space. Thus the small and large topoi associated with 
X are homotopy equivalent. An analogous fact holds for the small and 
large topoi associated with a scheme X (for the étale topology, or the 
Zariski topology). 


Exercises 


1. For any object C in a category C let evc: Sets"’” — Sets be 
the functor which evaluates each presheaf P at C. Show that 
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evc has a right adjoint, sending each set X to the presheaf 
X x Hom(-—,C) and a left adjoint, X H XHo™(,—). Conclude 
that evc is the inverse image functor of an (essential) geometric 
morphism Sets — Sets©’”. 

2. For groups G and H, let Hom(G,H) be the category of homo- 
morphisms from G to H, where an arrow from the homomor- 
phism ¢: G > H to ~: G > H is an element h € H for which 
@ = h~'ph [ie., for all g € G, O(g) = hv(g)h“"]. Thus, if G 
and H are each viewed as one-object categories, Hom(G, #) is 
exactly the category of functors from G to H, with arrows the 
natural transformations of functors. 


(a) Show that the construction (3.10) of a geometric morphism 
BG — BH from a homomorphism G — H of groups 
extends to give a functor Hom(G, H)°? — Hom(BG, BH) 
which is full and faithful. 

(b) Show that any geometric morphism BG — BBH is iso- 
morphic to one induced in the above way from a group 
homomorphism G — H. Conclude that the functor 
Hom(G, H)°? — Hom(BG, BH) of part (a) is an equiv- 
alence of categories. 

(c) Conclude that every geometric morphism BG — BH is 
essential. 


3. (a) For a functor A: C — Sets show that ~ @q A: 
Sets®” —Sets is the left Kan extension of A along y—see 
§5(6). (For the definition of Kan extension, see [CWM, 
p. 236].) 

(Background) Regard a ring R with unit element as an 
(additive) category with one object and the elements of R 
as morphisms, while Ab is the category of abelian groups 
and Modg is the (additive) category of right R-modules. 
Then regarding R as a right R-module defines a func- 
tor z: R — Mod, analogous to the Yoneda functor y. 
Observe that a left R-module L is an additive functor 
R — Ab and prove that — ®r L: Modpg — Ab is the 
left Kan extension of E along z. 


(b 


= 


4. For a functor ¢: C — D between small categories, let 
¢: Sets®” = Sets?” be the induced essential geometric mor- 
phism, as in Theorem 2.2. Show that the functor ¢;: SetsC” — 
Sets?” is the left Kan extension of yo ¢: C = D— Sets?” 
along y: C > Sets?” 
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5. 


10. 


11. 


12. 
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For ¢: C > D show that the geometric morphism ¢: Sets®” — 
SetsP” is a surjection if every object in D is a retract of an 
object in C. (The converse is also true, but is harder to prove.) 


. (A generalization to € from Sets) Show that a functor ¢: C > 


D between internal categories in a topos € induces a geometric 
morphism €© — €P between the categories of C- and D-objects 
(as described in Theorem V.7.1). 


. Let G be a topological group while G® is the same group with the 


discrete topology and p: BG® = BG is the geometric morphism 
of §3(14). 


(a) Show that the following conditions are equivalent: (i) p* 
preserves all small products; (ii) the intersection of any 
family of open subgroups in G is again open; (iii) there 
exists a nontrivial normal open subgroup U C G such that 
the quotient homomorphism G — G/U induces an equiv- 
alence BG = B(G/U). 

(b) Use part (a) to conclude that for a continuous homomor- 
phism H — G between topological groups the induced 
geometric morphism BH — BG need not be essential. 


. Let f: F — E be a geometric morphism with f = poi: F — 


E; — E its surjection-embedding factorization. Show that if f is 
essential, then so is p. 


. For a complete Boolean algebra B, show that points of the topos 


of sheaves on B [cf. §I1I.2, Example (d)] are in bijective corre- 
spondence with atoms of B. Use this result to give an example 
of a nontrivial topos without points. 

Let N be the set of natural numbers with the usual ordering, 
and consider the topos SetsN of “sets through time” (§1.1). Ac- 
cording to Exercise 1, every natural number n gives a point 
Dn: Sets > SetsN where Dn* = evaluation at n € N. Show 
that Sets also has a “point at infinity” and that the category 
of points of SetsN is (equivalent to) the ordered set N U {oo}. 
For a topological space X, show that the small topos Sh(X) is a 
retract of the big one Sh(T/X), as asserted in the text (see §10). 
A nonprincipal ultrafilter U/ on a set I can be viewed as a filter of 
subobjects of 1 in the topos Sets’. Show that there is no geomet- 
ric morphism from the filter-quotient Sets’ /U/ (as constructed in 
Chapter V) into Sets. (Hint: Show that in Sets! /U the coprod- 
uct )7,c91 of a family of subobjects of 1 does not exist for all 
sets S.) 
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13. (a) For elementary topoi € and F, show that the product cat- 
egory € x F is again a topos. [Objects of € x F are pairs 
(E, F) where F is an object of € and F one of F, and 
similarly for arrows of € x F.] 
(b) For any other topos G, show that there is an equivalence 
of categories 


Hom(€,G) x Hom(F,G) —~— Hom(E x F,G) 


natural in all three arguments. (In other words, for topoi 
the product as categories yields the coproduct in the 2- 
category of topoi and geometric morphisms. ) 


14. (a) Similarly to Exercise 13(b) above, show that for small cat- 
egories C and D and topoi €, there is a natural equivalence 


of categories 


— 


Flat(C, £) x Elat(D, £) > Flat(C x D, €) 


(the equivalence sends a pair of functors A: C — € and 
B: D — € to the functor C x D — € given by (C, D) b 
A(C) x B(D)). 

Conclude that the presheaf topos Sets is the prod- 
uct of the presheaf topoi Sets©”” and Sets?” | in the sense 
that for any cocomplete topos €, there is a natural equiv- 
alence of categories 


(CxD)°P 


c 


Hom(€, Sets”) x Hom(E, Sets?” ) —>— Hom(€, Sets‘CxP)°"”), 


15. (a) Given two Grothendieck topologies J on C and J’ on D, 
let J x J’ be the smallest Grothendieck topology on the 
product category C x D such that for any C € C and Dé 
D, and for any covering sieves S € J(C’) and S’ € J'(D), 
the sieve 


{(f,9)|f € Sand ge S’} 


is a (J x J’)-covering sieve of (C, D). Show that the equiv- 
alence of Exercise 14(a) restricts to an equivalence of cat- 
egories 


ConFlat(C, €) x ConFlat(D, €) — ConFlat(C x D, €). 
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(b) Conclude that 


Hom(E, Sh(C, J)) x Hom(E, Sh(D, J’)) % Hom(€,Sh(C x D, (J x J’)). 


(Thus the 2-category of Grothendieck topoi has products.) 

16. Reorganize the material of §4 as a proof of the following. Any 
geometric morphism f: F — &€ determines a comonad G on fF, 
a topology j on €, and an equivalence of categories Fg = Sh; E. 
Moreover, there is a surjection F — Fg and an embedding 
Sh; € > € which, composed with the equivalence, yield the given 


f. 


VITl 
Classifying Topoi 


The idea of “classifying” geometric or algebraic structures or spaces 
by maps into a given space is familiar from topology. For example, 
for any abelian group 7 and any n, there is a classifying space K (7,7) 
for cohomology: for each space X, cohomology classes a € H"(X,7) 
correspond to (“are classified by”) maps X — K(z,n). After reviewing 
some of these topological examples in more detail, we introduce a similar 
notion of a classifying topos. Again, the idea is to classify structures 
over topoi by maps into one suitably constructed topos. For example, 
a topos R is said to be a classifying topos for commutative rings when 
for any topos € there is a natural equivalence between ring objects in 
E and geometric morphisms € — R. An application of the results on 
continuous filtering functors from the previous chapter will construct 
such a classifying topos 72; it will turn out to be the topos of set-valued 
functors on the familiar category of finitely presented commutative rings. 
This will follow from the fact that this category is “freely generated” by 
the polynomial ring Z[X], in a suitable sense to be formulated below 
(see Proposition 5.1). 

Instead of developing the general theory of classifying topoi, the main 
purpose of this chapter is to discuss a number of specific examples. The 
case of a classifying topos for principal G-bundles (“torsors”) is closely 
related to the topological examples and will be discussed first in §2. 
After having formally introduced the concept of a classifying topos (§3), 
we will present the especially simple classifying topos for an object, and 
then that for commutative rings. In §6, we will prove that the Zariski 
topos is a classifying topos for local rings. In §7, we will describe the 
category of simplicial sets while §8 will show that it is a classifying topos. 

The discussion of classifying topoi will be continued in Chapter X, 
where we will prove a general existence theorem, giving a construction 
of a classifying topos for any type of structure which can be suitably 
axiomatized (essentially, the use of negation and universal quantification 
in the axioms is restricted). 

Some indications of the many uses of classifying topoi are given in 
the exercises here and in the Epilogue. 
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1. Classifying Spaces in Topology 


The study of a classifying topos was motivated by examples from 
topology—the classifying spaces for cohomology, and for principal bun- 
dles for topological groups. The purpose of this section is to describe 
briefly some of these motivating examples from topology. 

For a topological space X the n-dimensional (singular) cohomology 
group H"(X,7) with coefficients in the abelian group 7 is defined for 
any natural number n. It is a covariant functor of 7 and a contravariant 
functor of X. Moreover, two continuous maps f, g: X — Y which are 
homotopic (f ~ g) induce the same group homomorphism in cohomology 


f* =9*: H"(Y,7) — H"(X,7), when f ~g. 
There is, in particular, a space AK(x,n) with cohomology 


H" (K(x, n), 7) = Hom(z, 7), (1) 
Wt lito, 

with a cohomology class 7, in dimension n corresponding under (1) to 
the identity group homomorphism x —» 7. It has the following prop- 
erty: For any n-dimensional cohomology class c, € H”(X,7) in any 
paracompact space X there is a continuous map f: X — K(z,n) with 
f*%n = Cn, and this map f is unique up to homotopy. When [X, Y], as 
usual, denotes the set of homotopy classes of (continuous) maps X — Y 
this means that there is a bijection 


) = [X, K(x, n)], (2) 
trio f:X — K(x,n), f*yn = Cn. (3) 


In other words, every n-dimensional cohomology class in any space X 
arises by “pulling back” the “universal” cohomology class 7, along a 
map X > K(z,n), unique up to homotopy. The bijection (2) is natural 
in X and in 7, and this bijection determines the universal class as the 
preimage of the identity map of K(7,n). 

The space K(z,n) is called the classifying space for cohomology or 
the Hilenberg-Mac Lane space of degree n and coefficients 7. It may be 
constructed as a CW-complex in which 7 in dimension n is the only 
nontrivial homotopy group. Alternatively, it is a simplicial set whose 
q-simplices, for each dimension g, are the elements of Z"(A%,7): the 
n-dimensional cocycles on the qg-dimensional simplex A?. For n = 1 and 
the group Z of integers, the space K(Z, 1) is the circle St, 

This classifying space may be used to construct cohomology oper- 
ations. Thus, if 8,4; € H"+*(K(x,n),7) is any cohomology class of 
dimension n +i in the space K(7,n), then each n-dimensional class cn 
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in a space X determines f as in (3) and hence f*@n4i € H"+*(X,z). 
One thus obtains an “operation” of degree i, 


Ga et" Bass H”(X,m) > H"*"(X,z), (4) 


natural in X. For example, when 7 = Ze, the resulting operations are 
those of the well-known Steenrod algebra modulo 2. 

Another example is given by the classifying space for G-bundles, for 
G a topological group. We start with the example of the Grassmann 
manifold Ggn, where points are the d-dimensional linear subspaces of 
an n-dimensional (real) vector space, with the evident topology on the 
“points”. (For example, G,,, is the manifold of lines through the origin 
in n-space, so is just the real projective space of dimension n ~ 1.) 
Consider also the Stiefel manifold Sz, whose “points” are all ordered 
d-frames (v1,...,vq) of d orthonormal vectors in R”. Since each such 
frame spans a d-dimensional subspace, this gives an evident projection 


p: San - Gan- (5) 


Moreover, two such frames (21,..., Vg) and (w1,...,Wq) span the same 
subspace in R” iff there is an orthogonal transformation carrying the 
first frame into the second. In other words, the d-dimensional orthogo- 
nal group acts transitively and continuously on each fiber (each inverse 
image of a point) of the map p of (5). 

This projection (5) is an often studied example of a principal bundle 
over the Grassmannian Gq p.- 

For any topological group G, a principal G-bundle over the topolog- 
ical space X is a space E equipped with a continuous map 


p: EaxX 
together with a continuous left action of G on EF 


pi Gx EB, u(y) =9°Y, (6) 


for g € G, y € E, which preserves the fibers [i-e., is a map over X in 
the sense that pu(g,y) = py for all g € G and y € E}. Moreover, one 
requires that p: E -> X be locally the projection from a product, in the 
sense that there exists a covering of X by a family U, of open sets and 
for each index a a homeomorphism (a so-called “local trivialization” ) 


ba: GX Uy —~ p*(Ug) (7) 
which is a map over U, and which respects the G-action, in that 


Poalg, x) = 2, ba(gh, Xx) = Lg, dalh, r)) (8) 
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for all g € G, h € G, and x € U,. From this definition it follows that p 
must be a surjection and that the action of g on each fiber p-1(x) must 
be both free: 


L(g, y) = y implies g = e (= the unit of G), (9) 
and transitive: 
For y, y' € p-'(z) there exists g in G with gy =y/’. (10) 


Under these two conditions (9) and (10) one says that the action of G 
on each fiber is a principal action. 

A principal G-bundle may thus be pictured in terms of the covering 
{U.} as a collection of products G x U,, pasted together on the overlaps 
U, 1 Ug, much as in the description of a smooth manifold in terms 
of charts from an atlas (Chapter II); as in that case the choice of a 
particular covering U, is irrelevant. Detailed formulas for the piecing 
together of the products G x U, may be found in one of the standard 
texts on fiber bundles ([Steenrod], [Husemoller]). In particular, the 
real (or complex) Stiefel manifolds described above constitute such a 
bundle. The tangent bundle for a differentiable manifold is a principal 
bundle for the (real) full linear group. A regular covering map p: E > X 
of topological spaces, with # connected and X pathwise connected and 
locally simply connected in the large, provides another example. Recall 
that regularity means that the image under p of the fundamental group 
m1 (E) is a normal subgroup N of 7,(X). A standard construction (using 
covering transformations) shows that the quotient group G = 1,(X)/N 
acts on F and makes F a principal bundle for the group G. In particular, 
the universal covering map of such an X is a principal bundle for the 
group 7(X). 

If p: EF — X and q: F — X are two principal G-bundles for the 
same group G over the same space X, a map p — g of bundles is a 
continuous map f: E — F of spaces over X which respects the action 
of G. In other words, for all y € EF and g € G, 


af (y) = Ply); f(u(9,y)) = #9, fy). (11) 


Such a map f is necessarily a homeomorphism. Thus it is reasonable to 
call two principal G-bundles over the same space X equivalent iff there 
is in this sense a map f from one bundle to the other. 

If u: Y — X is a continuous map into the base space X of a principal 
G-bundle p: E — X then the pullback ze, along wu, as in 


Exx Y ——— E 


Lp 
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is a principal bundle over Y. Specifically, given local trivializations 
a: Gx U, = p Us, of E for a cover Uy of X will yield local trivi- 
alizations u*¢q given for the covering u-!U, of Y by 


U*ba(g,y) = (dalg, Uy), y) € m2 wu Ua 


forgé Gandy €u!Ug. 

The remarkable fact now is that for each topological group G and 
for each space Y, satisfying some mild conditions, there is a universal 
principal G-bundle, written 


pg: EG > BG, (13) 


such that every principal G-bundle over Y is equivalent to a pullback as 
in (12) of this universal bundle along some continuous map u: Y - BG. 
This map u is unique up to homotopy; in other words, the homotopy 
classes [Y, BG] correspond—naturally in the space Y—to equivalence 
classes of principal G-bundles over Y. If kg(Y) is the set of those equiv- 
alence classes, the result may be stated as a natural bijection 


cy: ke(Y) — [Y, BG]. (14) 


The general construction of this universal G-bundle is given in the 
texts mentioned above. (See also [Milnor], [Segal].) We note that in 
case the space Y is a polyhedron of sufficiently small dimension, then the 
Og-bundle given by the Stiefel manifold in (5) is the universal bundle 
for the real orthogonal group Og. 
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When the group G is discrete the corresponding classifying space 
for principal G-bundles, as discussed in §1, has a more topos-theoretic 
aspect. Indeed, given a principal G-bundle p: EF — X, take an open 
cover {U,} of the space X for which there are local trivializations 
da: G x U, = p~'U,q for each index a. Since the group G is discrete, 
this homeomorphism ¢, can also be written as a homeomorphic map 


ba: )/Ua—> P'(Ua) (1) 


geG 


from the disjoint sum >> U, of G copies of U,. Thus, it follows that 
the map p: E — X is a covering map in the sense described in §II.4: 
a map p: E — X for which each point x € X is contained in an open 
neighborhood U for which the inverse image p-'(U) is homeomorphic 
(over U) to a disjoint sum of copies of U. Since a covering map is 
a fortiori étale, this proves one direction of 
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Theorem 1. A principal G-bundle on X for a discrete group G 
is (equivalent to) an étale map p: E — X with a continuous action 
G x E > E over X such that 


(i) For each point x € X the fiber E, = p~'({zx}) is nonempty; 
(ii) The action map G x E, — E, on each fiber E, is both free and 
transitive. 


Conversely, start with an étale map p with these two properties. 
Since it is étale, there is for each point e € E a neighborhood U of pe = x 
and an open neighborhood V of e such that p induces a homeomorphism 
V — U. But each action map g: E — E has a two-sided inverse given 
by the group inverse g~!. Therefore, gV is an open neighborhood of 
ge with p(gV) = U. Moreover p~'(U) = U, gV since the action on all 
the fibers is transitive, while gv M hV = @ whenever g # h because the 
action is free. Hence the homeomorphisms gV ~ U combine to yield 
a local trivialization G x U © p"'U of p. Therefore, p is a principal 
G-bundle. 

This leads to the introduction of a definition of a “torsor”, 


Definition 2. A G-torsor over the space X for a discrete group G 
is an étale map E — X with the properties (i) and (ii) above. 


Here étale maps to X may be replaced by sheaves. Each étale map 
p: E — X is equivalent, as in §I1.6, to a sheaf F of sets on X; to 
wit, the sheaf F for which each set F(U) is the set of all cross-sections 
s: U — E of p over the open set U. Moreover, the action G x E — E 
when composed with cross-sections yields a left action of the group G 
on the sheaf F' in the following sense: A collection 


pu: Gx F(U) —— F(U), U open in X, (2) 


of left actions of the group G on the sets F'(U) which are natural in U, in 
that each inclusion p: V — U of open sets gives a commutative diagram 


Q x F(U) #2 F(U) 
oxF0)| F(p) (3) 


Gx F(V) = F(v) 
of maps of sets. Now each such action of the discrete group G on the 
sheaf F induces a corresponding (left) action yz of G on each stalk F, 
of the sheaf. Explicitly, the stalk F, at a point x is the colimit of the 
sets F(U) where U ranges over the open neighborhoods of x in X, so 
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that the natural action maps py of (3) induce maps pz: G x Fy — F, 
on each colimit 


be: GX lim F(U) —— lim F(U) = Fe. (4) 
zeU zeU 


Thus, the definition of a torsor may be restated in terms of sheaves as 
follows. 


Corollary 3. A G-torsor over a space X is a sheaf F of sets on X 
together with a natural (left) action of G on F, as in (2), such that 


(i) the stalk F, at each point x € X is nonempty; 
(ii) each induced action j1z of (4) is free and transitive on Fy. 


The action map (2) transposed gives a map G > F(U)F) of sets 
which sends each g € G to a map 0y: F(U) — F(U); by the natu- 
rality condition (3) these maps @y constitute a natural transformation 
0: F — F of functors. Since Hom(F, F) is the set of all such natural 
transformations the maps py together give a function 


fi: G —+ Hom(F, F), (5) 


as an action of G this is a morphism of monoids. This means that the 
two diagrams 


1—#-,Hom(FF) Gx G—*.Hom(F, F) x Hom(F,F) 


| A “| [eons 


G, G mea eres Hom(P, F) 

(6) 
commute, where e designates the unit element and m the multiplication 
of the group G. Thus (5) and (6) give a definition of the action of the 
(discrete) group G on the sheaf F’. 

With this, the above characterization of a G-torsor can be restated 
without explicitly using the stalks of the sheaf. For this, recall from 
§VII.1(7) the geometric morphism given by the global sections functor 
T as 

A: Sets ——— $Sh(X) :T, (7) 


where the constant sheaf functor A is left adjoint to [ and left exact. 
Moreover, any sheaf F” has global cross-sections ['(F’) = F’(X). In 
particular, for the exponential sheaf FF as described in equation SIT.8(2), 


this gives 
T(FF) = F(X) = Hon(F, F). 
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Therefore the monoid morphism 7Z of (5) is i: G > I'(F*) or, by trans- 
position, a map A(G) — F¥ of sheaves or, by transposition again, a 
map 

p: A(G) x F ——— F, (8) 


The commutative diagrams (6) for the monoid homomorphism become 
by transposition along the adjunction between I and A the following 
commutative diagrams of sheaves: 


A(G) x F, MO). 


BL 

Here we have written e and m for the maps of sheaves induced by the 
respective structure maps e: 1 — G and m: G x G — G of the group 
G. Moreover, since A preserves finite limits, the constant sheaf A(G) 
with the arrows e and m is itself a group-object in the category Sh(X) 
of sheaves. Thus the commutativity of (9) states that the map ji of (8) 
satisfies the standard requirement for an action of the group object AG 
on the sheaf F in the category Sh(X). In other words, an action (2), (3) 
of a discrete group G on a sheaf F' is the same thing as an action of the 
group object A(G) on the sheaf F. 


Lemma 4. For a discrete group G, a G-torsor on a space X is an 
action 4 of G on a sheaf F over X, as in (3), for which 


(i!) F — 1 is an epimorphism of sheaves; 
(ii!) the action ji: A(G) x F — F of (8) induces an isomorphism 
(fi, 72): A(G) x F —, F x F of sheaves. 


Proof: The map F’ — 1 of sheaves, considered as a map of étale 
spaces over X, is epi iff each induced map F, — 1, = 1 of stalks is a 
surjection of sets. Thus, condition (i') here is equivalent to condition (i) 
in the Definition 2 of a G-torsor. Moreover, the map (j1, 72): A(G)xF > 
F x F induces at each point z € X a map (ji, 72) of stalks, as in the 
top row of the commutative diagram 


(Ag x FP), 2s (F x Fe 
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The identifications made by the vertical arrows of this diagram yield the 
dotted map G x F, — F, x F,, readily identified as (j1,, 72), where 
[tz is the action map on the stalk F,, as described in (4), There- 
fore, the map of sheaves (ji,72): A(G) x F — F x F is an isomor- 
phism iff each map (ji, 72), of stalks is, or iff for each c € X the map 
(2,72): Gx F, — F, x F, is an isomorphism. The latter isomorphism 
states exactly that the action of G on the stalk F, is both free and 
transitive. Therefore, condition (ii’) of the lemma is equivalent to the 
corresponding condition (ii) in the definition of a torsor. This proves 
the lemma. 


With this terminology and result, Theorem 1 may be restated as 


Theorem 5. For any space X and any discrete group G there is 
a natural bijective correspondence between principal G-bundles on X 
and G-torsors over X. This correspondence associates each G-torsor, 
regarded as a sheaf F' with G-action as in Lemma 4, to the corresponding 
étale space over X with the corresponding G-action on the stalks. 


Now Lemma 4 describes G-torsors in Sh(X) purely in terms of the 
geometric morphism 7: Sh(X) — Sets with y, = [ and 7* = A, as 
in (7). This description makes sense for any topos € equipped with a 
geometric morphism 7: € — Sets—and hence for any cocomplete topos 
€; in particular for any Grothendieck topos. This suggests the following 


Definition 6. Let G be a discrete group (in Sets) while y: E > 
Sets is a topos over Sets. A G-torsor over E is an object T of € equipped 
with a left action u: y*(G) xT — T by the group object y*(G) for which 


(i) the canonical map T — 1 is an epimorphism; 
(ii) the action 4 induces an isomorphism in € 


(ut, 72): y*(G) x T —T xT. (11) 


In this connection, recall that the inverse image functor -y* preserves 
finite limits, so that y*(G) is indeed a group-object in €. Thus, a G- 
torsor over € is a special kind of object in the topos £7 (©) of objects 
with a y*(G) action, as defined in §V.6. 

We have shown that every principal G-bundle over a space X can be 
viewed as a G-torsor in the sheaf topos Sh(X). A quite different example 
of a torsor is present in the topos BG of all right G-sets (which is to 
be sharply distinguished from the classifying space BG discussed in the 
previous section). Namely, there is a canonical G-torsor Ug over BG, 
arising from the fact that G acts on itself both from the left and from 
the right. Thus, let Ug be G itself viewed as a right G-set, that is, as an 
object of BG, with right action Ug x G — Ug given by multiplication. 
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To see that Ug has the structure of a G-torsor over BG, recall first 
that the inverse image of the geometric morphism 7: BG — Sets is the 
functor -y*: Sets — BG which sends a set S to the same set S with the 
trivial right G-action (so that s-g = s for all s € S and g € G). Now 
the multiplication of G gives a map 


pi y*(G)xUg Ue; (gh) = gh. (12) 


This is an arrow in BG, for G acts trivially on y*(G) and by right 
multiplication on Ug, and for g, h, kin G the identity u(g, hk) = p(g, h)k 
holds. Clearly j: defines a left action of y*(G) on Ug, for which the map 
(ut, 72): y*(G) x Ug — Ug x Ug is an isomorphism. [Indeed, if we 
forget the right G-action involved, then this is just the isomorphism 
G x G > Gx G sending a pair (g, h) to (gh, h).] Thus, by Definition 6, 
(Ug, ») is a G-torsor in BG. 

We claim that this torsor Ug is in some sense the universal G-torsor, 
or that the topos BG “classifies” G-torsors, analogous to the fact that 
the space BG classifies principal G-bundles. To state this in a more 
precise way, first observe that, for any topos € over Sets, the G-torsors 
over € form a category 

Tor(E,G) (13) 


in a natural way: for G-torsors (T,y) and (T",') over E, a map 
f: T,n) — (T’,p’) is simply an arrow f: T — T’ of E which re- 
spects the left action by y*(G). [So Tor(€,G) is a full subcategory 
of €Y(@) ] It is not difficult to see that any such map between G- 
torsors over € is necessarily an isomorphism (Exercise 3). The cat- 
egory Tor(€,G) depends functorially both on € and on G. The de- 
pendence on G will not be discussed here. Tor(€,G) depends func- 
torially on € in the following way: Let f: F — E be a geometric 
morphism of topoi over Sets; so if we denote the unique geometric 
morphisms into Sets by ye: E — Sets and yz: F — Sets, then 
by this uniqueness f must satisfy yeo f = yr. If (T,p) is a G- 
torsor in €, then by applying the inverse image functor f* we obtain 
an object f*(T) of F; moreover, since f* is left exact, f*(T) comes 
equipped with an f*ye*(G)-action f*(u): f*ye*(G) x f*(T) > f*(T) 
such that (f*(u), 72): f*ye*(G) x f*(T) — f*(T) x f*(T) is an iso- 
morphism. Since f*ye*(G) = yz*(G), it follows that these definitions 
make (f*(T), f*()) a G-torsor over F. Clearly, f* sends maps between 
G-torsors over € to maps between G-torsors over ¥, and hence yields a 
functor 


f*: Tor(E,G) ——> Tor(F,G) (from f:F—-€). = (14) 


The claimed universality of the G-torsor Ug over BG can now be stated 
in the following way. 
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Theorem 7. For each discrete group G and each topos E over Sets 
there is an equivalence of categories 


Hom(€, BG) —>— Tor(E, G) (15) 
between geometric morphisms € — BG and G-torsors over E; moreover 
this equivalence is natural in E€, in the sense that, for any geometric 
morphism f: F — E over Sets, the diagram 

Hom(€, BG) —~—> Tor(€, G) 


Hom( 1965] | : (16) 


Hom(F, BG) —~— Tor(F, G) 


commutes (up to natural isomorphism). 


Notice that, by naturality, the equivalence (15) is completely deter- 
mined by what it does to the identity on BG (for the case € = BG). It 
will appear from the proof that, under the equivalence (15), id: BG > 
BG corresponds to the torsor Ug over BG. By naturality, it then fol- 
lows that, under (15), an arbitrary geometric morphism f: € ~ BG 
corresponds to the G-torsor f*(Ug). Thus, Theorem 7 states that (up 
to isomorphism) any G-torsor over any topos € comes from applying 
the inverse image functor of a suitable geometric morphism f: E — BG 
to the torsor Ug over BG. It is in this sense that Ug is the universal 
G-torsor. 


Proof of Theorem 7: The starting point is the Theorem VII.7.2 
stating that, for any small category C and for any topos €, there is an 
equivalence of categories 


Flat(C, €)  Hom(E, Sets”). (17) 


Under this equivalence, a geometric morphism f: E > Sets” corre- 
sponds to the flat functor f*oy: C Sets©” — €. In particular, for 
the case € = Sets©’”, the identity morphism on Sets©’” corresponds 
to the Yoneda embedding y: C > Sets©”. 

In particular, a group G can be viewed as a category C with one 
object (call it +), while geometric morphisms € = BG = Sets®”’ cor- 
respond to flat functors G — € by (17). To prove the theorem about 
torsors, it thus suffices to show that there is a natural equivalence of 
categories between flat functors G — & and G-torsors over E. 

Giving a functor A from a group G to Sets amounts to giving a set 
T = A(*) and a left action of G on this set T. A corresponding result 
holds for any topos €, over Sets by a geometric morphism y = ye: E > 
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Sets. Indeed, a functor A: G > € determines an object T = A(x) of E, 
while the effect of A on arrows g € G is a homomorphism of monoids 
G — Hom¢(T,T) © Home (1,T7) & Home(y*1,T7) 
= Homsets(1, y«(T7)) = y(T7); 
by successive adjunction, this amounts to maps 
Y@oT, YG xTot 


in €. The fact that G — Hom(T,T) is a morphism of monoids means 
simply that the corresponding map y*(G) x T — T in € is an action 
[much as in the equivalence between the commutativity of (6) and of (9) 
above]. Therefore there is a bijection 


Functors(G, €) © €7 @) (18) 


which is an isomorphism of categories and is natural in €, in the sense 
that each geometric morphism f: F — € yields a commutative diagram 


~ 


Functors(G,£€) === £7") 


compose with r| |r (19) 


Functors(G, F) ———= Fre" (9), 


Here the map f* on the right is defined in much the same way as the 
map f* of (14) between torsors. For this isomorphism (18) we wish to 
show that flat functors on the left correspond to torsors on the right. 

For the special case where € = BG, notice that the identity BG > 
BG corresponds under (17) with C = G to the Yoneda embedding 
y: G — BG, and then under (18) to the left y*(G)-object y(*) = Ug 
in BG. Now suppose A: G — € is any flat functor. By (17), up to 
isomorphism A is of the form f* oy: G + BG — €, for some geometric 
morphism f: € — BG. Hence the left +*(G)-object corresponding to 
A under (18) is isomorphic to f*(y(*)) = f*(Ug) [by naturality as in 
(19)|. But Ug is a G-torsor in BG, so f*(Ug) is a G-torsor in €. This 
proves that, under (18), a flat functor G > € yields a G-torsor in €. 

To complete the proof, it remains to show that if (T, 4) is a G-torsor 
in €, then the corresponding functor A: G > € of (18) is flat. This 
functor A is described explicitly on objects by A(*) = T and on arrows 
g by A(g) = w(g): T > T, as in 
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To see that A is flat, we apply Theorem VII.9.1 and Definition VII.8.1 
and verify the three conditions there for a filtering functor. Condition (i) 
states that A(*) = T — 1 is epi, which is part of the definition of a G- 
torsor. Condition (ii) states that the family of arrows 


{(#(g),u(h)): T> Tx Tha nec (20) 


is an epimorphic family in €. To see that this is the case, first recall that 
y* and — x T are both left adjoints, so both preserve the coproduct 
G = Vxeq1 in Sets. Thus 7*(G) x T is ¥°,¢qT. This leads to a 
commutative square 


Se T === ¥7*(G) xT 
keG 
| | (4,72) (21) 
2o Gammn 
(gh)eG 


where 7 sends the summand with index k € G to the summand with 
index (k,e) via the identity map T — T. Since the right-hand map in 
(21) is an isomorphism, the bottom map must be epi, which means that 
(20) is indeed an epimorphic family. 

Finally, condition (iii) for a filtering functor states in this case that, 
for any two elements g, h € G with g 4 h, the equalizer of 


ptf{er  g#h, (22) 
m7 


is the initial object 0. (Indeed, when the empty family of arrows to an 
object E is epimorphic, that object E must be 0.) By precomposing 
the two arrows in (22) with y(h~*), it will suffice to show that, for any 
g € Gwith g #e, the equalizer of (g) and id: TT is 0. But consider 
the diagram 


0 ——_——_ 7*(1) x T ——— T 


i ae 


(1) xT *(G)x TT xT (23) 


> y 
y"(g)x1 (4,72) 


i 


T (ula) td) xT. 


Here A on the right is the diagonal map, so the upper square on the right 
is evidently a pullback, while the upper square on the left is a pullback 
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since 7*(G) xT = one T and coproducts in a topos are disjoint (recall 
that g # e, by assumption). By the vertical isomorphism, it follows 
that the outer rectangle in (23) is a pullback; that is, the equalizer of 
(u(g),id): T 3 T is zero, as was to be shown. 

This completes the proof of the theorem. 


3. Classifying Topoi 
There is an evident analogy between the equivalence of categories 


Tor(E,G) = Hom(€, BG) (1) 
of Theorem 2.7, and the bijection (14) of §1 for G-bundles: 
kg(Y) & [Y, BG]. (2) 


The first gives for each G-torsor over a topos € a classifying geometric 
morphism from € into the topos BG, while the second gives for each 
principal G-bundle over a space Y a classifying map Y — BG to the 
classifying space. Moreover, the identity BG — BG on the right of 
(1) corresponds to the “universal” G-torsor Ug over BG, and any other 
G-torsor is an inverse image of this universal one, as explained below 
Theorem 2.7. Similarly, the identity BG — BG on the right of (2) 
corresponds to the universal principal G-bundle over BG {cf. (1.13)], 
and any other principal G-bundle is a pullback of this universal one. It 
is thus natural to call the topos BG a classifying topos for G-torsors, 
just as the space BG is called a classifying space for G-bundles. 

In general, suppose we have a notion of a “structure” of a certain 
kind, such that for each topos € (or: for each topos € over Sets) there is 
a category of such structures in €. One says that a topos B is a classifying 
topos for these structures if there is an equivalence of categories between 
geometric morphisms € — B and such structures in €, and if, moreover, 
this equivalence is natural in €. 

A simple example of a possible type of structure is that of a ring- 
object in a topos €. We will see in §5 below that there exists a classifying 
topos B(rings) for ring-objects, in the sense that the category of ring 
objects in a topos € is equivalent to the category of geometric morphisms 
E — Bi(rings). 

To be more specific, let us collect the axioms for the structures of a 
certain kind (e.g., G-torsors or rings) into a “theory” T, and call these 
structures M in a category € the T-structures or the “T-models” in 
E. Let Mod(E, 7) denote the set of all those models in €. Since each 
notion of a “structure” will also determine the notion of a morphism of 
that structure, with the usual composition of morphisms, this Mod(€,T) 
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will be a category. Moreover, we will suppose that the inverse image 
morphism of a geometric morphism f: F — € (over Sets) will carry 
any T-structure M in € to a T-structure f*M in F. Thus, Mod(€,T) 
for each such T will be a contravariant functor of €. These conditions 
clearly apply for the familiar algebraic theories T, such as the theory 
of rings. We will not pause here to examine more general notions of a 
“theory”. 

A classifying topos for T-models is then a topos B(T) over Sets with 
the property that for every cocomplete topos € there is an equivalence 
of categories 

ce: Mod(€,T) —~+ Hom(E, B(T)) (3) 


which is natural in €. This naturality, of course, means for each geomet- 
ric morphism f that under the equivalence the operation M +> f*(M) 
corresponds to composition with f; i-e., 


cr(f*M) = ce(M) 0 f. (4) 
In other words, each diagram 
Mod(€, T) —z—> Hom(€, B(T)) 
r| sets) (5) 
Mod(F¥,T) —>— Hom(F, B(T)) 


commutes up to natural isomorphism. As in the case of G-torsors, it 
follows that there exists a universal T-model (also called a generic T- 
model) Ur in B(T), namely, the model corresponding to the identity on 
B(T) under the equivalence (3), for the special case where € = B(T). 
This universal T-model Ur has the following characteristic property: for 
any topos € and any T-model in €, there exists a geometric morphism 
f: € — B(T) (unique up to isomorphism), such that M = f*(Ur). 

For example, for a given site (C, J), the previous chapter studied 
continuous filtering functors from C into a (cocomplete) topos €. For C 
fixed, such a continuous filtering functor A: C — € may be considered 
as a type of “structure” in €. Corollary VII.9.2 describing geometric 
morphisms to sheaf categories states that (among cocomplete topoi) the 
topos Sh(C, J) of sheaves on the given site (C, J) is a classifying topos 
for continuous filtering functors on C. This result is in some sense the 
“basic theorem” concerning classifying topoi and many other results are 
special instances of it. For example, Theorem 2.7 above states that the 
topos BG of right G-sets is a classifying topos for G-torsors. The proof 
of this theorem proceeded from the special case of this “basic theorem”, 
stating that BG classifies filtering functors on the one-object category 
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G, by identifying such functors G — € into a topos € with G-torsors in 
E 


The next sections will give a number of other examples of classify- 
ing topoi, using much the same strategy: Starting from the fact that 
Sh(C, J) classifies continuous filtering functors on C, we next describe 
such functors on C for special sites (C, J) in more familiar terms. This 
moreover will illustrate the flexible use of the notion of a site. 


4. The Object Classifier 


The simplest, and at the same time perhaps most basic example of a 
classifying topos is the so-called object classifier. This is a Grothendieck 
topos (usually denoted by S[U]), with the property: for any cocomplete 
topos € there is an equivalence between objects of € and geometric mor- 
phisms € — S[U]. In other words, there is an equivalence of categories, 
natural in €, 

ce: E —~— Hom(E, S[U}), (1) 


sending an object E of € toits “characteristic” geometric morphism € — 
S[U]. As in the previous section, the identity S[U] — S[U] corresponds 
to a “universal” object of S[U], which is usually denoted by U. The 
naturality of (1) then implies that 


ce(E)"(U)SE (2) 


and that c¢(£) is the unique geometric morphism (up to isomorphism) 
with this property. 

The notation S[U] and the universal property expressed by (1) and 
(2) are reminiscent of ring theory. There each “ground ring” k yields the 
polynomial ring k[z], while for any k-algebra A there is an isomorphism 


ca: A—~— Hom, (k[z], A), 


like (1), given for each a € A by ca(a)(x) = a, just as in (2). 

Now why does such a topos S[U] exist? The reason is simple enough, 
for if C is any small category with finite limits, then by the results in 
Chapter VII, geometric morphisms f: E — Sets©” correspond to left- 
exact functors A: C — €. So if we let C be the category with finite 
limits which is “freely generated” by a single object G, then Sets©” 
has the universal property (1) required of S[U]. We shall now describe 
this free category C explicitly in Lemma 2 below. But first, we consider 
the dual case, with finite limits replaced by finite colimits. 

For any two categories A and B with finite colimits, let us write 
Rex(A, B) for the category of right-exact functors from A to B (i.e., 
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those functors which commute with finite colimits), and natural trans- 
formations between them. A category ¥ with finite colimits is said to be 
freely generated by a specific object G € F if for any other category B 
with finite colimits the evaluation of a functor at G yields an equivalence 
of categories 

evg: Rex(F, B) —~— B, (3) 
which is natural in B. As usual, such a free category Ff, if it exists, is 


unique up to an equivalence of categories. 


Lemma 1. The category Fin of finite sets is the free category with 
finite colimits generated by the object G = {*} (a singleton set). 


Proof: Notice first that if a functor F: Fin — B into a category B 
preserves finite coproducts, it preserves all finite colimits. Indeed, to see 
that F preserves coequalizers, consider in Fin any coequalizer of finite 
sets 

ime of ne a3 (4) 
Here each set S$ in Fin is a coproduct |],-¢ G of one-point sets. Since 
the functor F preserves finite coproducts, it sends the coequalizer (4) to 
the following diagram in B, where B = F'(G): 


[J 2— |]s——- [2 (5) 
ses teT reR 


But by the universal property of coproducts, this diagram is a coequal- 
izer in B iff, for any object X € B, the diagram 


Hom(R, B(B, X)) ——— Hom(T, B(B, X)) => Hom(S, B(B, X)) 


is an equalizer of sets. But this is indeed the case because (4) is a 
coequalizer. 

Now let B be any category with finite colimits, and consider the 
evaluation functor 


evg: Rex(Fin, B) —— B, (6) 


which sends a right exact functor F': Fin > B to its value F(G) in B on 
the one-point set G. There is also a functor in the opposite direction, 


¢: B —— Rex(Fin, B), (7) 


defined as follows. For an object B € B, ¢(B) is the functor which sends 
a finite set S € Fin to the coproduct 


o(B)(S) =] ]B 


sES 
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of S copies of B. For an arrow u: S > T in the category Fin of finite 
sets, ¢(B)(u): I],<5 B > [<7 B is the unique map for which all the 


diagrams of the form 
I] B $(B)(u) al B 


ses teT 
| [rm 
B= B 


commute (for each s € S); here 7, and 7, (,) denote the coproduct in- 
clusions. The functor ¢(B) thus defined evidently preserves finite co- 
products, hence preserves all finite colimits as observed above. In the 
evident way, ¢(B) is a functor of B. 

Now the composite evg o¢: B — B is clearly isomorphic to the iden- 
tity functor. Also, for any right exact functor F’: Fin — B, there is a 
natural isomorphism ¢ 0 evg(F’) = ¢(F(G)) & F, precisely because F' 
commutes with finite coproducts and any finite set S has S = [], G. 
This proves the lemma. 


Since a left exact functor from Fin®? into a category B with finite 
limits is the same thing as a right exact functor Fin — B°P, Lemma 1 
yields: 


Lemma 2. The category Fin? is the free category with finite limits 
generated by the object G = {x}. 


In other words, for any category € with finite limits, there is an 
equivalence between left exact functors Fin°? — € and objects of €, 
again given by evaluation at the one-point G. In particular, when € is 
a cocomplete topos, we may apply Corollary VII.9.4 to the special case 
where the site (C, J) is the category C = Fin®? equipped with the trivial 
Grothendieck topology (only maximal sieves cover). Then the topos 
Sh(C, J) of sheaves is the whole functor category Sets” = Sets™. 
Thus, VII.9.4 yields an equivalence between geometric morphisms € > 
Sets and left exact functors Fin°? — €: 


Hom(€, Sets") = Lex(Fin®, €), (8) 

and hence by Lemma 2 an equivalence 
Hom(E, Sets™2) —~— €. (9) 
For a geometric morphism f: € > Sets“, this equivalence (9) sends f 


first [by the equivalence (8) above from VII.9.4] to the left exact functor 


f*toy: Fin? — Sets" —. €, and then (by the equivalence of Lemma 2) 
to the object f*y(G) of €. This shows that the topos Sets! is the 
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desired object classifier. From now on we will also denote this topos 
Sets™™ by S[U]. 

What is the universal object U of Sets“? To find this object 
U, we take € = Sets™® and apply the equivalence (9) to the iden- 
tity map Sets®2 — Sets™". The resulting object on the right of 
(9) is then y(G) € Sets™". But since G is a singleton set, y(G) = 
Fin(G, — ): Fin > Sets is essentially the inclusion functor. 

For the record, we state: 


Theorem 3. The topos S[U] = Sets®™® is the object classifier, with 
universal object U € Sets™2 the inclusion functor Fin > Sets. So for 
any cocomplete topos €, there is an equivalence of categories, natural in 

2 


Hom(E,S[U]) “> €, fr f*(U). (10) 


It is not difficult to describe explicitly the quasi-inverse of the functor 
(10), ie., the functor cg of (1); see Exercise 4. 


5. The Classifying Topos for Rings 


In this section and the next, “ring” will always mean commutative 
ring with unit element. Thus, if C is any category with finite limits, a 
ring-object in C (or briefly: a ring in C) is an object R of C equipped 
with morphisms 


13 RES RXR (1) 


in C for which the usual identities for a commutative ring with unit 
(expressed by diagrams in C) hold. For example, a ring in Sets is an 
ordinary ring, while a ring in the category Sh(X) of sheaves on a space 
X is the same thing as a sheaf of rings on X (cf. §II.7). With the evident 
notion of morphism, this defines a category 


Ring(C) (2) 


of rings in C, while each left exact functor F: C — C'’ be 
tween categories with finite limits induces a functor Ring(C) — 
Ring(C’). Moreover, a natural transformation between such functors 
Cac yields a natural transformation between the induced functors 
Ring(C) > Ring(C’). In this sense, the category Ring(C) is a functor 
of C. 

In particular, for any topos € there is a category Ring(€) of rings in €, 
and for any geometric morphism f: F — € between topoi € and F, the 
(left exact) inverse image functor f* induces a functor f*: Ring(€) — 
Ring(F). We will now show that there exists a classifying topos for 
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rings; that is, a topos R with a ring object R in R, such that for each 
(cocomplete) topos € there is an equivalence of categories 


Hom(€,R) —*—> Ring(€), (3) 


natural in €. To prove the existence of such a classifying topos R with 
a universal ring R, we will proceed much as in the previous section by 
first constructing a suitable free category and then applying the results 
of Chapter VII. 

A category A with finite limits with a ring object A is said to be 
freely generated for rings by A if, for any other category C with finite 
limits, the evaluation of left exact functors at A induces an equivalence 
of categories 

Lex(A, C) —— Ring(C). (4) 


(Here, as before, Lex denotes the category of left exact functors and 
natural transformations between them; each such left exact functor takes 
ring objects to ring objects.) If A is freely generated by the ring A in 
this sense, then for any cocomplete topos € there are equivalences of 
categories, 


Hom(€, Sets*”) ~ Lex(.A, €) ~ Ring(€), (5) 


both natural in €. The second equivalence is that of (4), while the 
first equivalence is a special case of Corollary VII.9.4. Thus, Sets*” 
is a classifying topos for rings. Under the equivalences (5), a geometric 
morphism f: E > Sets*” corresponds first to the left exact functor 
froy: A Sets“” — €, and then [by the equivalence (4)]| to the ring- 
object f*(y(A)) in €. Thus the “universal” ring-object, which is the ring 
in Sets*” obtained by taking f to be the identity, is the representable 
ring-object y(A) € Sets*” corresponding to the ring A which freely 
generates the category A. 

In order to produce a classifying topos for rings, it thus suffices to 
describe a category with finite limits A freely generated for rings by a 
ring A. We will show that the category 


A = (fp-rings)°?, (6) 


opposite to the familiar category of all finitely presented rings, is such 
a category with finite limits freely generated by a ring object A. More- 
over, A will be the ordinary polynomial ring Z[X]! This is essentially a 
formulation of standard properties of fp-rings. 

Recall that a ring is finitely presented (over the ring of integers Z), 
if it is isomorphic to a ring of the form 


Z[X1,...,Xnl/(Pry--»,Pe)s (7) 
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where the P; are polynomials in the indeterminates X,,...,Xn. This 
category of finitely presented rings and all ring homomorphisms be- 
tween them has finite colimits: it has an initial object Z, while the 
coproduct of a finitely presented ring as in (7) and another such 
Z[Yi,---,¥ml/(Qi,--.,Qe) is their tensor product, again finitely pre- 
sented since 


Z[X1,...,Xn]/(Pr,--+, Pr) @Z[Mi, .--, Yn/(Qr,---s Qe) & 
Zl Xi aks Xs Viv snes Val (Pyeng Ov): 


Coequalizers also exist: a map a from Z[X,...,Xn]/(Pi,..., Pe) into 
Z[Yi,.-+; ¥m]/(Qi,---, Qe) is given by an n-tuple (a),...,@n) of poly- 
nomials in the indeterminates Y1,..., Ym, where a; = a(X;) and each 
P;(04,...,0n) = 0 (mod Qi,...,@e), while the coequalizer of any one 
such homomorphism a and a second one # = ((),..., @n) is the finitely 
presented quotient ring 


Z[Yi,---5 YmJ/(Qi,---, Qe, ~ Biy.-+y)Qn — Bn). 


To summarize: The category (fp-rings) has all finite colimits (with 
tensor products as coproducts and quotient rings as coequalizers) and 
all the objects of the category are constructed using such colimits from 
the one polynomial ring Z[X]. We will observe below that the category 
(fp-rings) is indeed freely generated by these operations on the ring 
Z[X]. This we regard as a categorical formulation of the basic role of 
polynomial rings in algebra. 

The opposite category A = (fp-rings)°? is therefore a category 
with finite limits. Furthermore, the object A = Z[X] is a ring-object 


in this category A: the arrows 0, 1: 1s AEA x A in A giving 


+ 
A a ring-structure are the following arrows in the opposite category 


A°P = (fp-rings): 
Z = 2[X] 3 2X, Y] = ZX] @2(X], (8) 


where X + Y and X -Y denote the unique homomorphisms sending the 
element X € Z[X] to X + Y and X -Y respectively, while similarly 0 
and 1: Z[X] > Zsend X to 0 € Z and 1 € Z, respectively. 


Proposition 1. The category A = (fp-rings)°? is a category with 
finite limits freely generated by the ring-object A = Z[X]. 


(Without using opposite categories, this proposition can also be 
phrased thus: the category of finitely presented rings is a category with 
finite colimits, freely generated by the “coring” Z[X].) 
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As explained above, this proposition asserts that for any category 
with finite limits C, the evaluation of a left exact functor F at Z[X] 
gives the following equivalence of categories 


evz{x]: Lex((fp-rings)°?, C) — Ring(C), (9 

Fw F(Z[X}). ) 
(This correspondence is a well-defined functor, for if F' is left exact, 
then it preserves ring-objects, hence sends the ring-object Z[X] of 
(fp-rings)°? into a ring-object of C.) 

To avoid possible confusion between the category (fp-rings) and its 
opposite, let us agree that any arrow or diagram written below is in the 
category (fp-rings); thus functors (fp-rings)°? — C will accordingly 
be viewed as contravariant functors from (fp-rings) to C. 

The proof of Proposition 1 will explicitly describe a quasi-inverse 
functor for the evaluation functor (9), to be denoted 


: Ring(C) — Lex((fp-rings)°?, C), 


10 
a (10) 
For a given ring R in C, the left exact functor ¢g: (fp-rings)°? — C 
is defined in the unavoidable way: Since ¢ is to be an inverse to the 
evaluation (9), we set 

or(Z[X]) = RB. (11) 


Moreover, since gr is to be left exact, it transforms coproducts in 
(fp-rings) (i.e., tensor products) into products in C. Hence, since the 
polynomial ring Z[X1,...,Xn] = Z[X,|]@®...® Z[X,] is the n-fold ten- 
sor product of copies of Z[X], we set 


ér(Z[X1,...,Xnl) = R". (12) 


An arrow P: Z[Y,,...,¥%] — Z[Xi,..-,Xn] in (fp-rings) is given by 
a k-tuple of polynomials P;(X),...,Xn), where i = 1,...,k; here P; is 
the image of Y; under P. Each such polynomial P,;(X),...,Xn) yields 
an arrow in C, 

PS®). R” 5 R, (13) 


defined from the ring structure of R by the familiar process of substi- 
tuting “elements” of R for the indeterminates X in the polynomial P;. 

For example, the polynomial X, - X_ + X3 +2 yields the following 
map (X)X2 + X3 + 2)(*): R3 — R, given as the composite 


Rx Rx RZ Rx Rt>Re1xRZSRx R+6R, 
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(4,1 
where 2: 1 — R is the composite 1 ai, RxR en R. 
The image under ¢p of the arrow P: Z[Y,,..., Ye] > ZX ..., Xn] 
as above is the map 


ér(P) = P® = (P®,..., P&™): RS RF. (14) 


To complete the definition of the functor ¢ on any finitely presented 
ring B, choose an isomorphism (i.e., a presentation) 


Op: BZ[X,...,Xnl/(Pi,---, Pr) 


for suitable polynomials P;. This quotient ring, by its definition, fits 
into a coequalizer diagram 


P 
Z(Y,-.-5 Ye] =3 Z[X1,...,Xn] —» Z[X,..-,Xn]/(Pi,---, Pe), (15) 
where P(Y;) = P, and 0(Y;) = 0. Hence we define the image under the 
contravariant functor dr of this quotient ring by the following equalizer 
diagram in C: 
n P® . ok 
ba(Z[Xiy.--)Xnl/(Piys--s Fe) ——> R” BRE, (16) 


Next, we define ¢r on a homomorphism h: B >= C 


B - C 


Oz (I? Jom its 


Z[X1,.--;Xn]/(Piy--+, Pk) ——> 2[M1,...,Win|/(Lis +, Lt) 


between finitely presented rings. Here the L’s are polynomials in the 
indeterminates W. Thus h is determined by the n elements 0ch05' (X;). 
Each is the equivalence class of a polynomial H; in the indeterminates 
W, where, for each 7, 


P, (Bh 20's Hy) © (Dig es). (17) 


As in (14), these n polynomials determine a homomorphism 
H‘®): R™ — R”. Now ¢r(B) and ¢r(C) fit into equalizer rows 


ér(B)———> R RE 
oe o® 
a 
(R) 
jC) eS Bert 
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The equalizer a lower left is determined by m arrows as: ¢r(C) - R 
which satisfy Lg(a,,...,@m) = 0. Hence by (17) above the composite 
H®®) oa consists of n arrows to R which satisfy the conditions P = 0. 
Therefore, by the definition of equalizers, there is a unique arrow @r(h) 
as indicated on the left above, it is independent of the choice of the H; 
in their equivalence classes, and makes @p a functor, as required in (16). 

Note that the construction uses a choice of isomorphisms 03; this 
amounts (here and below) to observing that the category fp-rings is 
equivalent to the category of all such explicit presentations. 

We prove below that, for each ring R in C, the functor dr thus 
defined is indeed a left exact functor (fp-rings)°? — C. Taking this for 
granted for the moment, we observe that ¢ is indeed a quasi-inverse for 
the evaluation functor (9). Indeed, one way around, each ring R in C 
may be written as 


(evzix] °¢)(R) = dr(Z[X]) = R 


[see (11)]; so evzjx)0¢ is the identity functor on the category of 
rings in C. The other way around, consider any left exact functor 
F: (fp-rings)°? — C. We wish to construct an isomorphism 


=TR: $0 evaix)(F) —- F, 


natural in F. Write R = evzjx)(F) = F(Z[X]). Then, since 
F is left exact, it transforms an n-fold coproduct Z[X1,...,Xn| = 
Z[X|®...® Z[X] of finitely presented rings into an n-fold product in 
C; so that there is for each n > 1 an isomorphism 


T=: R” —Y—> F(Z[X]) ®...@F(Z[X]) = F(Z[X,...,Xn]) (18) 


(and these isomorphisms can be chosen to commute with the iso- 
morphisms R" x R™ = R"+™ and Z[X),...,Xn]@Z[X,..., Xm] = 
Z[X1,---,Xn4m]). Now, by the definition of the evaluation functor (9), 
the ring-structure of R in C is the image under F' of the ring-structure 
of Z[X] in (fp-rings)°? in the sense that all squares in the following 
diagram commute: 


a R 
To |= 1; =/72 
F(1) F(X1+X2) (19) 
, Z F(Z[Xy, Xel), 
F(Z] St PAD Fae PA Xa) 


Z a Z[X] crt Z[X1, Xe]. 
1°42 
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But these horizontal arrows generate all the ring operations. Each such 
operation is given by a polynomial P. It follows that for each polynomial 
P(X),...,Xpn), the arrow P‘®); R" — R obtained by “substitution” in 
P corresponds via 7 to the image under F' of the arrow P: Z[Y] > 
Z[X1,..., Xn] with (Y > P(X,,...,Xn)), so that the square 


pcR) 


F(2[X1,-.-,Xnl) a> FAY), 


Z[X15.+2,Xy) —"—~ ZY], 


commutes. 
The construction (18) of r on R” now extends easily to a natural 


isomorphism 7: ¢r nee Indeed, for a finitely presented ring of the 
form Z[X),...,Xn]/(Pi,---, Px), the contravariant functor F' sends the 
coequalizer (15) into an equalizer in C, which we can compare to the 
equalizer (16) for the special case where R = F(Z[X]), via the isomor- 
phism 7 as in (18); this yields in C the following diagram (solid arrows): 


F(Z[Xy,..., Xl (Pt,-<-4 Pe) F(Z[Xi,..., Xn]) ad F(Z... Ye 


, F(0) 
{ 
7A +] > 
; P(r) 
br(Z[X1,...,Xn|/(Pi,-.., Px) R°® = BF, 
o® (21) 
where the rows are equalizers. Both squares on the right commute, by 
the universal property of the product R* and the commutativity of (20). 
Therefore, by the universal property of the equalizer rows in (21), there 
is a unique dotted isomorphism 7, as indicated on the left of (21). 

This proves for each left exact functor F that there is a natural 
isomorphism 7: ¢r = ¢ © evzix)(F) —> F. The construction is natural 
in F, 

To complete the proof of Proposition 1, it remains to show for any 
ring R in C that the functor dr: (fp-rings)°? — C, as defined above, 
is left exact. 

Now ¢r(Z) is the empty product of copies of R [cf. (12) for n = 0); 
ie., dr(Z) = 1, so dr preserves the terminal object. 

Also, since the product of two equalizer diagrams is again an equal- 
izer (“interchange of limits”), one easily verifies from (16) that opr pre- 
serves binary products. Finally, to see that @p preserves equalizers, 
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consider a coequalizer constructed in the evident way from two maps S$ 
and S" in the category of finitely presented rings, 


ZIV, --->¥ml/(Q) 3 ZX1,.+.;Xn]/(P) + BX, ..-, Xa (PF ~- 3} 
here Q = (Qi,..-,Qe) and P = (F,,..., Px), while the maps S and 
S’ send Y; to polynomials $;(X),...,Xn) and S{(X1,...,Xn) for i = 
1,...,m (which satisfy suitable conditions related to P and Q). We have 
to show that ¢z sends this coequalizer (22) to an equalizer diagram in 
C. First of all, if (22) is a coequalizer, then so is the diagram 


Z1Y1,...,Yml = Z[X1,...,Xal/(P) + Z[X,..., Xn] /(P, 9 — 8’), 


(23) 
obtained by precomposing (22) with the epimorphism 

Z[Yi,-.-; Ym] Z[Y1,..-, Ym]/(Q). Moreover, since dz sends the latter 
epimorphism to a monomorphism in C [in fact to an equalizer, as in 
(16)], @x sends (22) to an equalizer iff it does so for (23). So it suffices 
to show that @r sends coequalizers of the special form (23) to equalizers 
in C. Next, since (23) is a coequalizer, so is 


ZIVi,.- <> Yon) <= Z[X1,..-, Xa]/(P) » Z[Xy,..., Xal/(P, 5 — 8’), 
: (24 


and one readily checks that dr sends (23) to an equalizer in C iff it does 
so for (24). So by replacing S by S — S’ and S’ by 0 in (23), we see 
that it suffices to show that ¢r sends coequalizers of the form (23) with 
S’ = 0 to equalizers in C. So, from the polynomials S and P and k new 
indeterminates W,,...,W, construct the diagram 


Z[W,,..., We 


oO; 1P 


S,P 
ZIYi,..-,Yins Way. Wal a8 Oe —» Z[X),...,Xal/(S,P 


ie | | 


ZY,...,Yenl a BX sake Xn (PY ZX yeiaeg kal (Ge EP 


consisting of three coequalizers, two of the form (15). By Definition (16), 
or sends both the vertical coequalizer and the upper horizontal coequal- 
izer to equalizers in C. It follows by diagram chasing that it also sends 
the lower horizontal coequalizer to an equalizer in C. 

This shows that gr: (fp-rings)°? — C is a left exact functor and 
completes the proof of Proposition 1. 

As explained before the statement of Proposition 1, it follows that the 
presheaf topos Sets“P-T™*) is a classifying topos for rings, in which the 
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universal ring-object R in Sets‘fP-""€*) is the ring y(Z[X]) represented 
by Z[X]. But notice that a ring object in Sets“P-"™S*) is nothing but a 
functor from (fp-rings) into the category of rings; and that since Z[X] 
is the free ring on one generator X, the universal ring R = y(Z[X]) = 
Hom(Z[X], —) is simply the inclusion functor from (fp-rings) to the 
category of rings. We can therefore summarize the result of this section 
as follows: 


Theorem 2. The presheaf topos Sets“P-""8*) is a classifying topos 
for rings, and the universal ring R is the ring-object in Sets¢P-™"8*) 
given by the inclusion functor from fp-rings into rings. Thus, for any 
cocomplete topos € there is an equivalence of categories, natural in €: 


Hom(E, SetsP-Ti"8*)) —, Ring(E), 
fr f"(R). 


6. The Zariski Topos Classifies Local Rings 


In this section we will show that the Zariski topos (over the ring of 
integers Z) is a classifying topos for local rings. 

Before proving this result, we should first explain what we mean by 
a local ring object in a topos. Recall that a ring R in Sets is called a 
local ring if it has a unique maximal ideal. This condition is equivalent 
to the condition that for any element a of the ring R, either a or 1— a 
is invertible; that is, 


Va € R(abE R(a-b=1)V 3bE R(1—a) -b=1). (1) 


(For this equivalence, which uses the axiom of choice, see any book 
on commutative algebra, or regard it as an exercise.) The Mitchell- 
Bénabou language, introduced in Chapter VI, enables us to define the 
notion of a local ring object in a topos E: it is a ring-object R in € such 
that the formula (1) of the Mitchell-Bénabou language is valid in €. By 
definition of validity, this means that the union of the subobjects 


{aER|Ab(a-b=1)}R, 
{a €R|3b((1—a)-b=1)}4R 


of Ris all of R. Equivalently, consider the two subobjects of the product 
Rx R defined by 


U={(a,b)e Rx Rla-b=1} RXR, \ 


V ={(a,b) ERxR|(1-a)-b=1}5RX RK; (2) 
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then # is a local ring iff the two composites UR x R , Rand 


VoORxXR—>R form an epimorphic family in €. We recall that the 
objects U and V are constructed as the pullbacks 


re | ——— ons | 
a | b 
RxR—5R, Rx R=! R x F— SR, 


where 7: R — Ris the composition R = 1xR Saiki) RxR — R,ie., the 
map corresponding to the polynomial 1~ X. This gives a description of 
local ring-objects R in a topos € which is motivated by (but circumvents) 
the Mitchell-Bénabou language: The ring-object R in € is local iff the 
two projections 7,: U — R and ™: V — R for U and V as described 
in (3) yield an epimorphism U+V — R. 

By way of an example, we consider the case of the topos of sheaves 
on a space X: 


Proposition 1. For any topological space X, a sheaf R of rings is 
a local ring in Sh(X) iff at each point x € X the stalk R; is a local ring 
(in Sets). 


Proof: For any point z € X, the stalk functor Sh(X) — Sets, 
which sends each sheaf F to its stalk F, at x, commutes with colimits 
and with finite limits. [In fact, it is the inverse image of the geometric 
morphism Sets = Sh(1) — Sh(X) induced by the map of spaces X: 1 — 
X, see VII, §1.] In particular, the stalk functor (—), preserves the 
construction of the sheaves U and V defined in Sh(X) by the pullbacks 
(3), so that 


U, = {(a,6) € R,|a-b=1}, (4) 
V, = {(a,b) € R, | (1—a)-b=1}. 

Now R is a local ring in Sh(X) iff (m,72): U+V — R is an epi of 
sheaves. But a map of sheaves is epi iff it gives a surjective map of 
stalks at each point z € X (Proposition I1.6.6). Hence, since the stalk- 
functor (~— ), commutes with sums, (7,72): U+V — R is epi in Sh(X) 
iff for each « € X the map (m,72)z: Ur + Ve — Re is a surjection of 
sets. By (4), this is the case precisely when R,z is a local ring for each 
point x € X. 

Thus, the sheaf of germs of smooth functions on a manifold is a local 


ring-object, because the ring of germs at each point is a local ring. (In 
fact, this is the motivation for the term “local” .) 
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In the previous section, we have observed that there is an equiv- 
alence between ring-objects R in a topos € and left exact functors 
(fp-rings)°? — €. Explicitly, given such a left exact functor F, the 
corresponding ring-object R in € is F(Z[X]). This is indeed a ring in €, 
since F is left exact and Z[X], as in §5(8), is a ring in (fp-rings)°?, ie., a 
co-ring in (fp-rings). Conversely, given a ring R in €, the corresponding 
functor 

or: (fp-rings)°? — € 


sends the fp-ring A = Z[X),..., Xn]/(Pi,..., Pe) to the following equal- 
izer in E: (P 
Lys oP, 

ér(A)————> R" i (5) 
This description {as in (16) of §5] readily yields the corresponding def- 
inition of dr on arrows. The following lemma gives a condition for a 
ring Rf in a topos € to be local, phrased in terms of this corresponding 
functor @p. 


Lemma 2. Let E be a topos, let R be a ring in €, and let 
or: (fp-rings)°? — € be the corresponding left exact functor. The 
following are equivalent: 


(i) R is a local ring in €; 
(ii) dp sends the pair of arrows in the category (fp-rings), 


Z[X, Y]/(XY - Y +1)— Z[X] — Z[X,Y|/(X-Y-1), 


to an epimorphic family (of two arrows) in E; 


(iii) for any finitely presented ring A and any elements a,,...,an € A 
such that a, +---+@n = 1, dr sends the family of arrows in 
(fp-rings) 


{A-— Alay") |i=1,...,n} 


to an epimorphic family {¢r(A) — dr(Ala;*]) | ¢ = 1,...,n} 
in E. 


Proof: (i)<(ii) This follows immediately from the explicit descrip- 
tion of the functor ¢r; to wit, dp by (5) sends the ring Z[X, Y]/(XY—1) 
to the equalizer U of (3). By §5(17) it thus sends the second arrow 
Z[X] > Z[X, Y]/(X-Y — 1) of (ii) into the composite U — Rx R—» R 
with U defined as in (3), and the arrow Z[X] — Z[X, ¥]/(X-Y~¥ +1) = 
Z[X,Y]/(1 — X)-Y—1) into V>RxR —, R, again as in (3). So, 
by the definition of a local ring, (i) is equivalent to (ii). 

(iii)=>(ii) is also clear since (ii) is the special case of (iii) in which 
A=Z([X] while n = 2, a, = X, a2 =1-X. 
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(ii) (iii) Assume that (ii) holds, and suppose given a finitely pre- 
sented ring A and elements a),...,a¢, € A with Sa; = 1. Consider 
first the case n = 2, so that ag = 1 — a;. Form the pushouts of finitely 
presented rings along the map Z[X] — A sending X to aj, as in 


Z[X, Y]/((1 — X) -¥ — 1) —<—— Z[xX] ——=> Z[X, Y]/(X -Y -1) 


| ‘ | 


A[(1 — a1)7*] 


A————> Alay", 
giving the indicated ring of quotients A[(1 — a,)~1] or A[aj’]. These 
squares are pullbacks in (fp-rings)°?, hence they are sent by the left- 
exact functor dr to pullbacks in €, as in 

V 7 R«————__ U 


| | | 


$r(Al[(1 — a1)~*]) —— $r(A) —— ¢r(Alay') 


But by assumption U — R and V — R form an epimorphic family in €, 
and hence so does the pullback of this family. This proves (iii) for the 
case n = 2. 

The general case follows by induction on n. For instance, if n = 3, 
we are given three elements a), a2, a3 such that a, + a2 +a3 = 1 in the 
ring A. Then, by the case n = 2 just proved, dp sends (the duals of) 
A > Alay] and A > A[(a2 +.a3)~"] into an epimorphic family in €. 
Let @2 and @3 denote the images of az and a3 under A — A[(a2+a3)7]. 
Then the inverse b € A[(a2 + a3)~1] gives (b- G2) + (b- G3) = 1. So by 
the case n = 2 again, the following two arrows in 


Al(az + a3)~*] > A[(ag + a3)~*][(6 “h 
A[(a2 + @3)~*] > A[(a2 + a3) (6: 


& 
in 


are sent to an epimorphic family in € by the functor gr. Thus since the 
composition of epimorphic families is epimorphic, we conclude that ¢r 
sends the three arrows in (fp-rings) 


A- Alay", 
A > Al(az + a3) *|[(b@2)~"], (6) 
A — A[(az + a3)~1][(ba3)7"] 


to an epimorphic family in €. Since A > Alag 1} and A > Ala3*| 
respectively factor through the second and the third arrows in (6), it 
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follows that {A —> Alaz"] | i = 1,2,3} is also sent to an epimorphic 
family in €. Then induction in this way proves the lemma. 


Now recall from §3(4) of Chapter III that the duals of the cover- 
ing families in condition (iii) of this lemma form a Grothendieck topol- 
ogy J on the category (fp-rings)°?. The associated topos of sheaves 
Sh((fp-rings)°?, J) is the Zariski topos Z (over the ground ring k = Z). 
The results of Chapter VII will now yield the following theorem: 


Theorem 3. The Zariski topos Z (over the integers Z) is a classi- 
fying topos for local rings; i.e., for any cocomplete topos E there is an 
equivalence of categories 


Hom(€, Z) ——- LocRing(€), (7) 


where LocRing(€) is the category of local rings in €. The universal local 
ring is the structure sheaf O of the Zariski topos (cf. §III.4). 


Proof: As a special case of Corollary VII.9.4, there is an equiv- 
alence between Hom(£,Z) and the category of continuous left exact 
functors (fp-rings)°? — €. By the results of the previous section to- 
gether with Lemma 2, this category in turn is equivalent to the full 
subcategory of Ring(&) consisting of local rings. This proves the equiv- 
alence (7). The identification of the universal local ring proceeds as 
before: it is the object of the topos Z represented by the object Z[X] of 
the site (fp-rings)°?. Since Z[X] is simply the underlying-set functor 
(fp-rings) — Sets, this is exactly the structure sheaf O discussed in 
Chapter IIT. 


Notice that the Zariski topos Z is a subtopos of the classifying 
topos SetsP-!"85) constructed in Theorem 5.2. The “universal mod- 
els” occurring in Theorem 3 above and in Theorem 5.2 are really 
the same object viewed in different topoi: the underlying-set functor 
R: (fp-rings) — Sets is an object of Sets“?-™8*) but also of the 
Zariski topos Z since it is a sheaf for the Grothendieck topology on 
(fp-rings)°? which is used to define the topos Z. But notice that, as an 
object of Sets“P-!€*), the functor R is not a local ring (of course, the 
universal ring is not local). Theorem 3 expresses the fact that Z is the 
largest subtopos of SetsP-Tines) which turns R into a local ring; or in 
other words, that the Grothendieck topology J defining the topos Z is 
the smallest topology which forces R to be a local ring. At first thought, 
it is perhaps surprising that this “universal” way of making the ring R 
in the topos SetsP-T!8*) into a local ring does not involve changing R, 
but changing the topos in which it lives! 
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7. Simplicial Sets 


In algebraic topology the homology of a space X is often calculated 
by triangulating the space, forming “chains” as linear combinations of 
the resulting simplices (the “triangles” ) and using the boundary of these 
simplices to define a boundary for each chain and so to supply the homol- 
ogy groups of the space X. More generally, one may replace the simplices 
of a triangulation of X by all the “singular” simplices of X; that is, all 
the continuous maps T: A” — X of the standard n-simplex A” into 
X. It is essential that the vertices of A” are ordered: 0<1<.---<n. 
Then the sets 

Sn(X) ={T: A® = X} (1) 


for all n with suitable maps between them form a typical example of a 
simplicial set. We recall the definition from §I.1(10). 

The simplicial category A is the category with objects all finite 
nonempty ordered sets of the form 


[n] = {0,1,2,...,n}, n>0, (2) 


and with morphisms a: [n] — [m] all (weakly) increasing functions a, 
so that 0 <i < j <n implies a(i) < a(j). One also says that such an a 
is monotonic. 

This category has a geometric interpretation, as follows. Choose for 
each n a “standard” affine n-simplex A”, with its n + 1 vertices, say 
Up,--+,Un, in linear order. Recall that an affine map is defined to be one 
which preserves weighted averages, while every point of A” is a suitable 
weighted average of its vertices. Hence each morphism a: [n] > [ml] of 
A determines a unique affine map A” — A™; namely, that sending each 
vertex v; of A” to the vertex vg, of A™. This defines a canonical functor 


A®: A — (Spaces), [nJ ro A”, (3) 


and indicates that the simplicial category A is (isomorphic to) the cat- 
egory of standard affine simplices and affine maps. 
Among the affine maps, the i** face map 


eo Al A, 4=0).2.,7, (4a) 


is that increasing monomorphism e,: [n — 1] — [n] which omits (only) 
the vertex i of A”. The i*> degeneracy 


ge Aer Ss eB", i=0,...,n, (4b) 


is that increasing epimorphism [n +1] — [n] which collapses (only) i+1 
to i. One may readily show [CWM, pp. 172-173] that every increasing 
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monomorphism is a composite of e’s and every increasing epimorphism 

is a composite of o’s. Hence every increasing a is a composite of ¢’s and 

o’s. Thus A may be described in terms of “faces” and “degeneracies”. 
A simplicial set S is a (contravariant) functor 


S: A°P — Sets (5) 


on the category A. One usually writes S,, for S[n] and then a*: S,, > 
Sn for the action on S of a morphism a: [n] — [m] of A. This includes in 
particular the “face” operators d; = ¢* and the “degeneracies” s; = oj, 


di: Sn — Sn-1; 8): Sn > Spray, 4=0,...,n. (6) 


They satisfy exactly the identities already listed in §1.1(16). Indeed, a 
simplicial set S can also be described (as in 81.1) as a family of sets 
S, with operators d; and s; as in (6) which satisfy the listed identities 
(II.1.(7)] [CW™M, pp. 172-173], [Mac Lane, 1963, pp. 233-234], and 
[May, pp. 1-4]. One often visualizes a simplicial set S' by the diagrams 
of face and degeneracy operators, 


So = S, FE S..., So —> S; = So... . (7) 


The category (Ssets) of all such functors S is the category of simplicial 
sets 
Ssets = Sets”. (8) 


There are non-geometric examples. For instance, every category C 
determines a simplicial set C,, called its nerve, where each 


Crh = C1 Xe, Ci Xeq «+ XCy C1 (n factors) 


is the set of composable strings of n arrows. The face maps are given 
as suitable compositions, while the degeneracies insert identity arrows 
(suggestion: complete this definition). 

Every natural number m gives a representable functor y[m] = 
Homa, (—, [m]) which is of course a simplicial set. We will need to use 
the case m = 1, to be written 


V = Homa (-, [1]). (9) 


We outline the role of simplicial sets in homology. The elements of 
S, are regarded as “n-cells”; an n-chain is a finite linear combination 
of elements s; € S,, with integral coefficients; these n-chains form an 
abelian group C,. The alternating sum of the induced face operators 
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d;: Cy — Cp, is the “boundary” homomorphism (take the faces in 
order, with alternating signs) 


n 


O=SoCVidii Ca Can, n= ],.... (10) 


i=0 


This is an algebraic translation of the generalized “boundary” of a chain. 
By the identities for dj, one has 0@ = 0. Then the usual quotient 
“cycles modulo boundaries” defines the n> integral homology group of 
the simplicial set S as 


H,(S,Z) = Ker(8: Cp > Cn—1)/Im(O: Casi Cn). (11) 


For a topological space X, the simplicial set S(X) = S(X) defined in (1) 
is the standard “singular complex” of X; it gives by (11) the classical 
“singular” homology groups of the topological spaces X. In this section, 
we write S(X) to distinguish this from other simplicial sets. An essential 
point of the approach is the use of simplices A” with ordered vertices; 
for example, this choice avoids earlier difficulties with the orientation of 
cells. This idea will reappear in the next section, where the category 
Ssets turns out to be the classifying topos for linear orders! 


The simplicial set S(X) also serves to define homology and coho- 
mology groups of the space X with “coefficients” in any abelian group. 
Simplicial formulas also yield the cup product (the cohomology rings) 
and the Eilenberg-Zilber theorem for the homology of a product of spaces 
(cf. [Massey], [Mac Lane, 1963], etc.). One may also use S(X) to con- 
struct the homotopy groups and the “homotopy type” of X. Extensive 
use of face and degeneracy operators was required in calculating ho- 
mology and cohomology of the Eilenberg-Mac Lane classifying spaces 
K(x,n). Simplicial constructions are currently extensively used in K- 
theory and for simplicial sheaves. 


The standard n-simplex A” can be described by barycentric coor- 
dinates of its points or—more conveniently for our purposes—as the 
following subset of the unit n-dimensional cube I” with coordinates ¢,: 


A” ={(t),...,tn) |O<ti <---St<1fCl”. (12) 


Thus, A° is a point, A! is the unit interval I of the reals, A? is the 
“ypper” triangle with vertices (0,0), (0,1), and (1,1) in the square I?, 
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and so on, as in 


(0, 0,1) 


Thus, the i*" vertex v; of A” is the point v; = (0,0,...,1,...,1) with 
the first n — 7 coordinates zero. 

The linear order (12) will be used in the next section to construct 
classifying topoi for orders. To do this we need to describe more ex- 
plicitly the effect of the “standard simplex” functor A® of (3) on ar- 
rows. For this, let us write Hom4([m], [n]) for the set of those arrows 
u: [m] — [n] in A which preserve both bottom and top elements [so 
u(0) = 0; u(m) = nj. There is a bijection 


Homa ({n], [m]) ¥ Hom ((rn + 1], fn +1), (13) 
ar Ga, U4, (14) 


where an arrow a: [n] — [m] on the left of (13) corresponds to an arrow 
u on the right iff, for allO <i<nandallO<j<m+l, 


j<sa(i) iff uj) <i. (15) 


This states that the bijection (15) is an adjunction between posets— 
more specifically, that it is a so-called Galois connection ([CWM, p. 93)). 
This bijection may be stated more explicitly. The “if” of (15) may be 
written as 

u” (i) = a(é) = max{ j | u() <i}. (16a) 


There always is at least one such j, to wit, 7 = 0. The “only if” becomes 


min{i|j<a(i)}_ if i exists, 


16b 
n+l otherwise. ( ) 


a) = u(s) = { 


These suffice to define a = u~ from u or u = & from a, establishing 
the bijection. 
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The functor A*: A — (Spaces) then sends each arrow a: [n] > [m] 
of A to that map A®: A” — A™ of standard simplices which takes a 
point ¢ = (t1,...,tn) € A” to the point A%(t) = (A%(),,.. -, A(t) n) 
of A™ with j** coordinate, for 7 = 1,...,m, described as follows: 
0 if a(j) = 0, 
A*(),=< 1 if a(j) =n+1, (17) 
t03) otherwise. 
In particular, one may verify that this A® does take the i*” vertex 
(0,0,...,1,...,1) with n — i zeros in A” into the a(i)"" vertex of A”, 
it thus has the intended effect upon vertices. 
For a given space X the singular complex S(X) € (Ssets) as defined 
above may now be described by applying the Hom-functor to the map 
A?*: A — (Spaces); in other words, 


S(X) = Hom(A’, X). (18) 


On the other hand, each simplicial set S has a “geometric realization”, a 
space |S|. The intent is that each s € Si, is to be realized by a copy A” 
of the standard m-simplex, with its points labelled as (s,t), for t ¢ A”. 
Moreover, the realization Ar of the it face of s is to be “pasted” to 
A™ by the i** face map e;: A™~1 — A™. Much the same is to apply 
to degeneracies. Both cases can be combined as follows. Given any 
increasing map 


a: [n] > [m], A*; A? = A™, S(a): Sm 2S, (19) 
and any s € S,, the copy AS (a)s is to be pasted to A™ by the continuous 


map A@. Thus, take S,, x A™, the disjoint union of copies of A™ indexed 
by S,,, generate there an equivalence relation ~ by setting 


(S(a)s,t) ~ (s, A*(é)), te A”, 


for any s € Sj, and any a as in (19), and define the geometric realization 
as the quotient space (in an appropriate category) 


IS} =([] Sm x A™)/~. 


m>0 


Now this amounts exactly to defining the realization |S| as the following 
coequalizer of spaces 


I] 4° J] 4*7—!5l= Sens, 
"eb 25m 


(20) 
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with the familiar maps 9 and 7. But this is a “tensor product”, just like 
the tensor product of functors defined by coequalizers in Chapter VII 
[see for example VII.2 (11)], except that the present functors take spaces 
and not sets as values. “Cut and paste” becomes a tensor! 

The usual Hom-tensor adjunction also applies to this context; so 
“singular complex” and “geometric realization” form an adjoint pair of 
functors 

| —|: (Ssets) === (Spaces) :S (21) 


with geometric realization | —| left adjoint to S. Moreover, if the geo- 
metric realization functor S + |S| takes values in a category of spaces 
with good exactness properties (say in the category of compactly gener- 
ated spaces), then this realization functor can be shown to be left exact 
(see, e.g., (Gabriel, Zisman, p. 49-52] or [May, 1972, p. 57] for de- 
tails). Thus, if we interpret “spaces” as compactly generated spaces, the 
adjoint pair of functors (21) has the formal properties of a geometric 
morphism, except that the category of spaces is of course not a topos. 


8. Simplicial Sets Classify Linear Orders 


To explain how the category of simplicial sets is a classifying topos, 
we will now describe a topos-theoretic variant of “geometric realization” . 
This will apply to “linear orders” I with “bottom and top”—or “orders” 
for short. In the category Sets such an order, of course, is to be a set I 
together with a binary relation R C I x I which is a linear order with 
a smallest element 6 (for “bottom”) and a largest element t (for “top”) 
where b # t; we may write I = (J, R,b,t) for such an order. Then a 
morphism of orders in Sets is defined to be a function which preserves 
the linear order as well as the bottom and top elements. This defines a 
category (Orders) with objects such linear orders in Sets. 

More generally, in any topos €, one can define an “order” in € to be 
an object I of €, with a subobject RI x I and two global elements 
b, t: 1=3T, all such that the sentence of the Mitchell~Bénabou language 
which states that R is a linear order on I with bottom 6} and top ¢ is 
valid in €. Explicitly, (J, R,b,¢) is an order in the topos €, when the 
following sentences are valid in €, where “x < y” stands for (x, y) € R: 

(i) Ve EI (a <2), 

(ii) Va,y,z ET (xesyAysz>u <2), 

(iii) Vaz,y eT (a2<yAysr>r=y), 

(iv) Ve eI (b<a2Az St), 

(v) -(@ = 2), 

(vi) Vz,yEel(e@<yVy <2). 
An equivalent definition of an order I = (I, R,b,t) in a topos € may be 
given, by spelling out what it means for (i)-(vi) to be valid in €, say by 
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using Kripke—Joyal semantics. To do this, notice first that the subobject 
RI x I defines for each object E of the topos € a binary relation < 
on the set Home(E, I) of arrows E — I: for two such arrows f and g, 
say that 


f<g iff (f,g): B-I x I factors through R. (1) 


Also, by composing with the unique map FE — 1, the arrows b, t: 1 > J 
yield arrows bp, tg: E — I. The conditions (i)-(vi) above on (J, R,},t) 
are now equivalent to the following conditions on arrows f, 9g, h: E — I, 
for each object E € E: 


Gy Fey 
(ii’) if f <g and g <h, then f <h; 
(iii’) if f <g and g< f, then f = g; 
liv’) be <f and f < tp; 
(v’) bg A tp, unless E is isomorphic to the initial object 0; 
(vi’) there are arrows p: C — E and q: D > E in € such that 
(p,q): C+ D > Eis epi, while fp < gp and gq < fq. 


Conditions (i’)-(v’) are the familiar conditions for a partial order (with 
distinct smallest and largest elements) on the set of arrows E — I. But 
notice that the familiar condition “f < gor g < f” for a total order holds 
only on an “epimorphic” cover of E, as in (vi’). We leave the equivalence 
of these conditions with those in (i)-(vi) as an exercise. Readers who 
wish to avoid the Mitchell-Bénabou language at this point may take 
these last conditions (i’)-(vi’) as a definition of an order I = (J, R,6,t) 
in a topos €. 

These conditions (i’)-(vi’) can be stated more directly in terms of 
the structures RI x I and b, t: 1=3T, as follows. 


Lemma 1. The elements (I, R,b,t) form an order in the topos E iff 
the following conditions hold: 


(i) the diagonal A: I > I x I factors through RI x I; 
(ii) the subobject R x R = (I x R)N (Rx I) CI? has the property 


(™1,73) 


that R* R13 ———5 I? factors through R— 1°; 

(iii) the intersection of RI? and R— I? —+ I? is (contained in) the 
diagonal; here + = (2,7) is the twist map; 

(iv) bx I andt x I: 1x I—I-x I factor through R; 

(v) the diagram 


0 ———> 1 


is a pullback; 
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- 
(vi) the inclusion i: RJ? and the composition R—I? — I? to- 
gether form an epimorphic family in €. 


Proof: The equivalence between (i)-(v) and (i’)-(v’) is an instance 
of the standard correspondence between a structure on an object J, and 
a corresponding structure on each hom-set Hom¢(E, J), as discussed in 
§IV.8. For example, condition (ii) of the lemma follows from condi- 
tion (ii’) above, by taking E = R* R— I, and f, g, h: E I to be 
the compositions of the inclusion R * R— I? with the three projections 
I —, I. And conversely, assume (ii) of the lemma, and suppose given 
f, 9, h: E 1 such that both (f,g) and (g, h) factor through R= I?. 
Then (f,g,h): E — I? factors through I x R and through R x I, hence 
through R * R. Therefore, (f,h) = (7,73) 0 (f,9,h) factors through R. 

To see that (vi) of the lemma implies (vi'), assume (vi) and suppose 
given f, g: E =I. Let C and D be the pullbacks as in 


C——>R D——_+ R 

Lo a 
P 

oa Ga) °, cass me “@An 


Since (g, f) = 7° (f,g), the arrows C — E, D = E are the pullbacks 
along (f,g) of the epimorphic family in (vi), hence are epimorphic. The 
converse implication (vi')=(vi) also readily follows, by taking E = R 


and f, g: E I to be the composites R— I? —> I (i = 1,2). 


Example 1 (Presheaf Topoi). Consider the presheaf topos 
Sets©”” associated with a small category C. Let I be an object of 
Sets”, let R C I x I be a subobject, and let 6, t: 1 J be two arrows 
in Sets©”. Evaluation of these functors J and R at each object Ce C 
yields a set I(C’) with a relation R(C) C I(C) x I(C) on this set and two 
elements bc, tc € I(C). We claim that (J, R, 5, ¢) is an order in the topos 
Sets” iff, for each object C' € C, this structure (I(C), R(C), bc, tc) 
is an order in Sets. Indeed, this follows from Lemma 1, since limits 
and colimits in Sets©”” are computed pointwise (and, in particular, an 
arrow in Sets©’” is epi iff it is pointwise a surjection of sets). 

As a special case, consider as in §7(9), the representable object 


V = y([1]) = Hom(-—, [1]) (3) 


in the topos (Ssets) = Sets®””. For each n, the Hom-set V, = 
Homa ((n], [1]) is simply the set of increasing sequences (0, ...,0,1,..., 1) 
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of n+ 1 numbers, all either 0 or 1. This set V, has an evident “point- 
wise” linear order, with smallest element b, = (0,...,0) and largest 
t, = (1,...,1). In other words, V,, for “vertices” is the ordered list of 
vertices of A”+!, so has the structure of an “order” in Sets. More- 
over, this structure is evidently natural in n, in the sense that an arrow 
a: [n] — [m] in A induces a morphism of orders a*: Vin — Va. Thus, V 
for “vertices” is an order in the topos of simplicial sets. We will prove 
below that it is in fact the “universal” order. 
As noted above, one may visualize this simplicial set 


Va = Vert(A"t?) 


as the ordered list of the n + 2 vertices of the standard (affine) simplex 
A”*!; then, in particular, the degeneracy maps of these (affine) simplices 
induce the face operators on the vertex sets, as in §7(4b) above, by 


O75: Att — A” => d; = Vert(oi): Van - Va-1, VSO ls 
Similarly the affine face maps induce the degeneracy operators 
6: A714 A" > 5, = Vert(e,): Vn-2 > Va-1, i=0,...,n. 


The face and degeneracy identities then hold, so this gives a second 
description of the simplicial structure of V. 


Example 2. Consider the topos Sh(X) of sheaves on a space X. 
The sheaf C'o,1) of germs of continuous functions from X to the unit in- 
terval, with natural order <, satisfies Axioms (i)-(v) for an order-object 
in Sh(X), but not the last Axiom (vi): indeed, given two continuous 
functions f, g: U — [0,1] on an open subset U C X, there is in general 
no open cover U = VUW such that f < gon V and g < f on W. How- 
ever, there is an evident cover of U by closed sets V and W for which this 
holds. Thus, we are led to consider the partial order of all closed subsets 
of X (ordered by inclusion), with the Grothendieck topology J on this 
category given by locally finite covers. This means that a family {F,} of 
closed subsets of a given closed set F' defines such a cover {F,} € J(F) 
iff F = (J F; and each point in F' has a neighborhood which meets only 
finitely many F;’s. Then the continuous functions Cont(F, [0,1]) form a 
sheaf on this site of closed subsets of X, with the natural structure of 
an order (in the topos of all such sheaves on the site of closed subsets of 
[0, 1]). 


Having defined orders I = (I, R,b,t) in any topos €, we define mor- 
phisms I — I’ between such orders to be arrows ¢: I > I’ in € which 
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respect the order relation as well as the bottom and top elements; i.e., 
as arrows such that there are commutative diagrams 


This defines the category “Orders(€)” of orders in the topos €. 


Lemma 2. Each geometric morphism f: F — € induces a functor 
f*: Orders(€) — Orders(7) sending an order (I, R,6,t) in € to the 
order (f*(I), f*(R), f*(6), F*(t)) in F. 

Proof: We have to show that if (J, R,6,t) is an order in €, then 
f*(Z) is an order in F, with order relation f*(R) — f*(I x I) & f*(I) x 
f*(Z) and with bottom- and top-elements f*(b), f*(i): 1 = f*(1) - 
f*(Z). But this is clear from Lemma 1, since the inverse image f* (being 
a left exact left adjoint) preserves finite limits and arbitrary colimits 
(hence also epimorphic families), and thus f* preserves the conditions 
(i)-(vi) of Lemma 1. 

The standard simplices A" were constructed from the linear order 
of the real numbers in Sets. Similarly, an order (J, R,6,t) in a topos € 
can be used to construct for € a “standard simplex” functor there, 


MA ane. (4) 


defined exactly as the standard simplex functor A*: A — (Spaces) 
considered in §7 [cf. (4), (8)]. More precisely, for each n the standard 
n-simplex for the order I is the subobject of the “cube” I” 


Af = {(#1,---,2n) | #1 <+++< ay, }-—— I” (5) 
defined from the relation R as the intersection of the subobjects 

TE x Rx [272-8 JF ye [2 ym -2-k = gn 
(for k = 0,...,n— 2). In other words, A? — I” is the unique subobject 
with the property that an arrow f = (fi,...,f,): E — I” from any 


object E factors through Af iff f; <--- < fy; or: 


Home (E, AT) = {(fiy- ofa | fit Bod, fi<---< fa}. (6) 


460 VII. Classifying Topoi 


This defines the functor Af of (4) on objects. For an arrow a: [n] — [m] 
in A, consider the function @: {0,...,m+1} — {0,...,2 +1} as in 
§7(16b), and use it to define a map of “cubes” 


1%; I" 4 ]7™ 


in terms of the m projections 7;: J” — I as the unique map with 


~ ~ b 

a; 01l® are if 0 < aj) < n4+1 while 7; 0I% = 1" 51 — Tif 
~ t 

a(j) = 0, and 1;01% = 1" 41 -+T if a(j) =n +1. Then, since @ is 

order-preserving, there is a factorization 


I” Ie [™ 
| | (7) 
AP --za,-> OF 


as follows readily from (6). This defines A}: A — € on arrows; the 
definition is an exact copy of the construction of §7(3), §7(17) for spaces. 

Exactly as for spaces, this functor Af: A — € gives rise to an ad- 
joint pair of functors by the general Hom-tensor adjunction of Theo- 
rem VII.2.1 [bis]. The right adjoint is the “singular complex functor”, 
Sr, formed from the given order I: 


S 7: E — (Ssets), (8) 

defined for any object E € € and any n > 0 as in §7(18) by 
$1(E)y = Home(A?, B), (9) 
with restriction maps a*: S}(E)m — S}(E)n for a: [n] — [m] given 
by composition with A? as just defined in (7). The left adjoint is the 

“geometric realization functor”, 

|—|r: (Ssets) — € (10) 
defined, for any order J and any simplicial set S, as the tensor product 
[S| = S@,(A}). (11) 
Lemma 3. Geometric realization is natural in €, in the sense that 
for any order I in a cocomplete topos € and for any geometric morphism 


f: F — €, there is for any simplicial set S a canonical isomorphism 


|S| g(r) ——> f*([S|z). 
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Proof: First, since the inverse image functor f* preserves limits, 
and hence the monomorphisms (5), and since f*(I) is an order in Ff, 
there is a natural isomorphism 


FP o At & Abe: A > F. 


Furthermore, f* preserves colimits, and therefore commutes with tensor 
products; so for any simplicial set S there is an isomorphism (natural 
in S) 

F*(S@a AF) = S@a(f* o Af). 
These two isomorphisms together yield the isomorphism required in the 
lemma, by the definition (10) of geometric realization. 


The following somewhat involved proposition shows that this adjoint 
pair of functors (8), (10) constitutes a geometric morphism € — (Ssets). 


Proposition 4. For any order I in a cocomplete topos €, the geo- 
metric realization functor |—|;: (Ssets) — € is left exact. 


Proof: By Theorem VII.9.1 (flat = filtering), it suffices to show that 
the functor Af: A — € is filtering. So we'll check that conditions (i')- 
(iii’) for a filtering functor of Lemma VIL8.3 hold. 

First, since the standard 0-simplex AQ is the terminal object 1 of E, 
condition (i’) is clearly satisfied. 

Condition (ii’), “join any two objects” now refers to two objects [m] 
and [n] for the functor A* and requires the join of any two generalized 
elements F and G of Af( —), given as maps 

te ne 

from an object E € €. By the definition (6) of these arrows, F = 
(fi,--+)fm) and G = (g),...,9n), where f;, g; are arrows E > I in € 
such that f; <---< fm and g, <-+-: < gn. Together they give an arrow 
(F,G): E—I™*", To factor this through A;(m +7) as in (6) we now 
have to interchange some of the f,’s and g;’s so as to put them back in 
the “right” order. Thus if we know that g < f we switch (f,g) to (9, f) 
and so on to give a permuted sequence 


(fis-+-3 fms G1y-++5 Gn) 2 (hi, he, .+-,hn4m) (12) 
of functions h: E > I with hy <---< Anim. We write 
fi = huwy, 9) = hog), a=1,...,m; j=l,...n. (13) 
Here the permutation is for convenience represented by functions 


u: [m] > [m+ nl, v: [n] + [m+n], (14) 
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where for later use we have also set u(0) = 0 and v(0) = 0. 

But, hold on, we may not know that g < f! Condition (vi’) states 
only that there is an epimorphic pair p: C — E, g: D — E such that 
gp < fp on C, while the opposite, fp < gq, holds in D. Iterating this 
for all pairs (f,g) will yield a finite epimorphic family 


pe: Eg 2 E 


so that we know the “right” order of all the composites f;p¢, gj;p¢. For 
each € we now have the equations (12), (13), and (14) above, where u, 
v and h now depend on €, while h: Eg — I and f, and g; in (13) are 
replaced by fi ° pe, 9; 0 pe. Now hy <-+-- < hntm holds in Hom(£,, J), 
so these arrows collectively determine 


H: Ey > Atim+n] = AT" GS I™", 


The arrow u = ug of (14) may be regarded as an arrow [m+ 1] — 
[m+n +1] sending m+1 to m+n-+1; then it lies in the set Hom* 
defined in §7(13). Thus, w~ = wu for each €, and similarly for v. Now 
the equations (13) just above yield the commutative diagram 


Fee IC Tag 


“| [ise 


Aj[m +n] ——>.AG lm] x AGfn 
(Aé AS) 
Since the family pe is epimorphic, this is exactly the condition (ii’) of 
Lemma VIL8.3—the condition “joining” by H the two given generalized 
elements F and G. 

Finally, for condition (iii’), “equalize any two parallel arrows”, sup- 
pose given arrows a, 3: [n] — [m] in A. Consider all F: E > Ag{[n] 
in €, such that Af o F = Ae o F. We are required to find a “cover” 
of E; now E = 0 is covered by the empty family, so we can assume 
that EF #0. As in (6), F can be identified with a sequence of arrows 
fi: E — I, (i = 1,...,n), for which f; < +-- < fn. Consider the 
functions @, 8: [m +1] — [n + 1] leaving 0 fixed and sending m +1 
to n +1, corresponding to a, 8 as in §7(14) above. Then if we write 
fo = bg, fnui =te: E 1-1, for convenience, the assumed equality 


AfoF= A? o F means that Fx) Fe fa) for each j = 0,...,m+1. 


Thus if i € [n + 1] is such that @(j) <i< A(3), or BG) <i< a(7), one 
must have 
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by Axiom (iii’) for orders. Now consider the finite linear order ob- 
tained from [n + 1] = {0,...,n +1} by collapsing each of the intervals 
[a(7), B(Qy)], or [B(3), &(4)], to a point. This yields a new finite linear 
order L with an order-preserving quotient map 


a: [n+1]—L, 
such that +@ = 7G. Moreover, F yields the sequence of arrows (bg = 
fo. fi,--+>fns fn+1 =te), while each f,: E — I. This may be viewed as 
an order-preserving function f: [n + 1] > €(E,I), which by (15) must 
factor through the collapsed interval L by a map @, as in the commutative 
diagram 


QR 


[m + 1] ——3 [n+ 1] > L 
|? M5 (16) 
E(E, 1). 


ww 


Notice that since E % 0 and since f factors through 7: [n+1>L, 
the order LZ must have at least two distinct points; for otherwise bg = 
fo = fn4i = te, contradicting the axiom —(b = ¢) for orders. Thus, 
up to isomorphism, the linear order L must be of the form [é+ 1] = 
{0,...,@+1} for some integer £ > 0, so (16) yields a commutative 
diagram in €, 


At AT e 
nr 
— Ay —— A; 


Nee 


E 


where 77 : [£] — [n] corresponds to 7: [n + 1] — [€+1] & L as in §7(14). 
In other words, 7~ serves to “equalize” the arrows given by a and £, 
so that condition (iii’) for a filtering functor is satisfied. This completes 
the proof of the proposition. 


Theorem 5. The topos (Ssets) of simplicial sets is a classifying 
topos for orders, with universal order V = Homa(-— ,[1]). More explic- 
itly, for any cocomplete topos € there is an equivalence of categories 


Orders(€) — Hom(E, (Ssets)). 


This equivalence associates with an order I = (I, R,6,t) € E the geomet- 
ric morphism € — (Ssets) whose inverse image functor is the geometric 
realization functor |— |: (Ssets) — €, while the direct image functor is 
the singular complex functor S;. 
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Proof: The representable object V is an order in (Ssets), as pointed 
out in Example 1 above. By Lemma 2, each geometric morphism f: €& > 
(Ssets) yields an order f*V in €. Moreover, one readily sees that for two 
such geometric morphisms f and g, any natural transformation a: f* > 
g* yields a morphism of orders ay: f*(I) — g*(J). Thus, one obtains a 
functor 

frof*y, Hom(E€, (Ssets)) — Orders(€). (17) 


In the other direction, an order J in a topos € yields a geometric mor- 
phism, call it Realy: € — (Ssets), as in the statement of the theorem: 
the inverse image functor Real; is the functor |—|;, geometric realiza- 
tion w.r.t. I (cf. (10), (11)]; while the direct image functor (Realy), is 
the singular complex functor, as in (8) and (9). This indeed defines a 
geometric morphism since realization is left exact (Proposition 4). No- 
tice also that a morphism of orders I — I’ in € induces first a natural 
transformation A} — As, between the corresponding functors A — €, 
and then, by functorality of the tensor product, a natural transformation 
Real; — Real;-. Thus one obtains a functor 


Real: Orders(€) — Hom(€, (Ssets)). (18) 


We now show that these two functors (17) and (18) are mutually 
inverse, up to a natural isomorphism, and this for any order I in €. 
This will use a standard property of the tensor product—“the tensor 
product with the ground ring is the identity”. More specifically, as in 
§VII.7(4), this states that the tensor product with Yoneda amounts to 
evaluation: 


y(C)@A& A(C). 


Now the given order J is by (5) the standard 1-simplex of €, so that the 
functor A$ takes [1] € A to I. Moreover y[1] is by (3) of Example 1 the 
linear order V of (3) in Ssets. Therefore 


I = Aj((1]) = y[1] @a Ap = VAS, 
so by the definition of the geometrical realization functor 
IT2=V@Asz=|V[7 = Real7(V). 
This shows that starting with an order I in € and producing first a 
geometric morphism by (18) and then a new order E by (17) gives back 
the same order J, up to isomorphism. 


For the other composite of (17) and (18) we use the isomorphism 


S=S@ny, S: A® _, Sets (19) 
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for any simplicial set S, with y the Yoneda embedding. For recall from 
Chapter VII that geometric morphisms f: € > Sets©”” correspond to 
flat functors A: C — € into any cocomplete topos €. Specifically, given 
such an f, the corresponding flat functor A is 


froy: C5 Sets©” — E, 


while given an A, the inverse image functor f* is the tensor product 
~ @c A. In particular, the identity geometric morphism Sets” > 
Sets©”” corresponds to a Yoneda embedding y: C — Sets©”. There- 
fore, for any presheaf R on C 


R= (id)"R & R@cy. 


This yields (19) for C = A. 

Now return to the “other way around” in composing (17) and (18). 
We claim that in (19) the Yoneda embedding y is itself of the form of a 
“standard simplex” functor A;: A — € for the particular order J = V— 
as will be proved (Lemma 6 below) from the explicit description of V. 
Hence (19) with C = A becomes 


S*S@p At = (Sly. 


Therefore, by naturality of realization (Lemma 3), for any geometric 
f: € — Ssets 


F*(8) = FU Slv) 
= |S p(y). 


This isomorphism is natural in the simplicial set S, hence yields an 
isomorphism of geometric morphisms Real,-(y) = f. This shows that 
the functors (17) and (18) are mutually inverse up to isomorphism; hence 
constitute an equivalence of categories. By Lemma 3, this equivalence is 
natural in €, so the following lemma finally completes the proof of the 
theorem: 


Lemma 6. If for I one takes the order V in the topos (Ssets), the 
standard simplex functor A$: A — (Ssets) is naturally isomorphic to 
the Yoneda embedding. 


Proof: Recall that the simplicial set of “vertices” V: A°P — Sets 
is the representable presheaf given for each n by 


Vp = Homa ((n], [1)) 
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with its natural (“pointwise”) linear order, as in Example 1 above. Un- 
der the bijection §7(12), there is thus an isomorphism, natural in n: 


Vz = Hom) ([2], [n + 1]) 


~{0,...,..+1}, a. 


the latter isomorphism because an arrow w: [2] > [n+1] which preserves 
top and bottom is determined by its value u(1) € {0,...,2 +1}. Thus, 
for the simplicial set Aj?’ = AY}, the isomorphism V,, = {0,...,n +1} 
of (20) yields an isomorphism 


(AT )n = {(21,...,2m) | vi € Va and 2) <---< am}, 
(via (20):) = {(21,...,0m)|O< 21 <---< am <n4+1} 
© Hom ([m + 1], [n+ 1), 
(by §7(13):)  Homa((n], [m]) 
=y[mn. 


These isomorphisms are natural in m and n hence show that A? = y[m] 
as simplicial sets, natural in n, or Ay, = y: A — (Ssets), as asserted in 
the lemma. 


Exercises 


1. A subobject A C E in a topos € is called complemented if 
Av (7A) = E (as subobjects of E). Prove that the topos 
Sets x Sets = Sets/{0,1} classifies complemented subobjects 
of the terminal object; i.e., that for any cocomplete topos, geo- 
metric morphisms € — Sets/{ 0,1} correspond to complemented 
subobjects A C 1 in €. 

2. (a) The Sierpinski space © is the topological space with two 
points 0 and 1, where {1} is open, but {0} is not. Show 
that for any topological space X, continuous maps X — © 
correspond to open subsets of X. 

(b) More generally, show that the topos Sh(*:) of sheaves on 
the Sierpinski space classifies subobjects of 1, in the sense 
that for any cocomplete topos €, there is a natural equiv- 
alence Hom(€,Sh(X)) = Subg(1). 


3. Let G be a fixed group (in Sets). 


(a) Prove that any morphism of G-torsors f: T — T’ in Sets 
is an isomorphism. 
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(b) Generalize (a) by replacing Sets with an arbitrary cocom- 
plete topos. (Hint: one way to do this is to imitate an 
argument for Part (a) using generalized elements as well 


as the isomorphism |],¢q T —+T x T of §2(21).] 


4. By applying Lemma 4.2, give an explicit description of the quasi- 
inverse ce: E — Hom(€,S[U]) for the functor (10) of Theo- 
rem 4.3. 

5. In §5 we have constructed a classifying topos ® for rings. Show 
that R is unique, in the sense that any other topos R’ for which 
there is a natural equivalence Hom(€, R’) = Ring(€) as in §5(3) 
must be equivalent to ®. (See Chapter X, Exercise 2 for a more 
general statement.) 

6. Let X be any object in a topos €, and let €/X —+ E be the 
canonical geometric morphism. Show that for any other topos 
f: F — € over €, geometric morphisms F — £/X over E cor- 
respond to global sections 1 — f*(X). State and prove this 
correspondence in terms of an equivalence of categories. (One 
says: “€/X classifies global sections of X, relative to €”.) 


Let I be the category of finite sets and monomorphisms, as in §II1.9, 
and let J be the atomic topology on I°P (every nonempty sieve is a 
cover for J). Recall from §III.9 that the topos Sh(I°?, J) of sheaves is 
(equivalent to) the topos B(Aut(N)) of continuous Aut(N)-sets. In the 
following three exercises, we will investigate this topos as a classifying 
topos. 


7. (a) For each set S and each object K of I, let Ms(K’) be the set 
of monomorphisms K — S. Show that this defines a flat 
functor Mg: I — Sets, and hence (by the results of Chap- 
ter VII) a point mg: Sets — Sets! of the topos Sets!. 
Show that this in fact yields an equivalence of categories 
m:M —>+ Hom(Sets, Sets"), where M is the category of 
sets and monomorphisms. 

(b) Show that the functor Ms: IPP — Sets is continu- 
ous for the atomic topology J iff S is infinite, so that 
m restricts to an equivalence of categories m: M, 3 
Hom(Sets, Sh(I°P, J)), where M... is the category of infi- 
nite sets and monomorphisms. 
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Let € be a cocomplete topos with the associated geometric mor- 
phism y: € — Sets [so that 7*(A) = [],<41 for any set A]. An 
object S of € is said to be decidable if the diagonal Ag 4S x S 
is a complemented subobject [i-e., S x S = Ag V 7(Asg) holds in 
the lattice of subobjects of S x S]. 


(a) Show that S is decidable iff for any pair of arrows f, 
g: E —S there is a (finite) epimorphic family p;: E; —~ E 
such that, for each i, either fp; = gp; or the equalizer of 
fp; and gp; is zero (the initial object). 

(b) Show that if M: I°P — € is a flat functor then S = M(1) 
is a decidable object. 

(c) For an object S of €, let Mg: IPP — € be the functor 
defined by 


Ms(K) = Mono(7*(K), 5), 


the object of monos 7*() — S. [Thus, for each object 
E € &, an arrow E — Ms(K) is the same thing as a 
mono (a,72): 7*(K) x E — Sx E in €/E.] Show that 
the functor Msg is flat whenever S' is a decidable object. 
Show that (b) and (c) yield an equivalence of categories 
between Hom(€, Sets") and the full subcategory of € con- 
sisting of decidable objects. (Thus, Sets! classifies decid- 
able objects.) 


(d 


~— 


9. Call an object S of E infinite if, given any object E € E and 


10. 


nm arrows (generalized elements) f),..., fr: E — S, there is 
an epi p: E’ —» E and an arrow g: E’ — S such that for each 
1=1,...,n, the arrows g and f;°op have equalizer zero. Using the 
previous exercise, show that there is an equivalence of categories 
between Hom(€, Sets") and the full subcategory of € consist- 
ing of infinite decidable objects. [Thus Sh(I°?, J) = B(Aut(N)) 
classifies infinite decidable objects.] 

Let 2N be the Cantor space, with (product) topology given by ba- 
sic open sets V,, = {x € 2N | x(i) = u(t) for 0 <i < n}, for each 
n > 0 and each finite binary sequence u = (u(0),...,u(n — 1)). 
Let Sh(2N) be the topos of sheaves on the Cantor space. This ex- 
ercise is to show that Sh(2N) classifies arrows N > 2 = {0,1}, in 
the sense that for any cocomplete topos €, there is an equivalence 
of (discrete) categories 


Hom(E€, Sh(2")) = Home (y*(N), 7" (2)) (*) 


where y: € — Sets is the canonical geometric morphism. 
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(a) Write O(2N) for the complete Heyting algebra of open 


— 


~— 


subsets of 2N, with the sup-topology; this is a site for 
Sh(2N). Show that the continuous left exact functors 
A: O(2N) — € into a cocomplete topos € are exactly the 
functors A: O(2N) — Sub¢(1) which preserve finite meets 
and arbitrary sups. Conclude that such functors A are 
determined by their values A(V,,) on all basic open sets 
Vu. 

Show that such functors A: O(2N) — Sub¢(1), preserving 
finite meets and arbitrary sups, correspond to sequences 
A; (i =0,1,2,...) of complemented subobjects A; € 1 in 
€, as in Exercise 1. (Hint: given A, define A; = A({x € 
2N | x(z) = 0}); conversely, given the A;, define A on each 
basic open set V,, by A(V,,) = (AGO) At AMOD)) 
where A“ = A; if k =0 and AW) =A, ifk =1] 

Show that such sequences { A, } of complemented subob- 
jects correspond to arrows y*(N) — y*(2) in €, and con- 
clude that there is an equivalence as stated in (*) above. 


IX 
Localic Topoi 


Among the Grothendieck topoi those of the form Sh(S) for some 
topological space S play a special (and motivating) role. In this chapter 
we consider a related class of topoi—those of the sheaves on a so-called 
“locale”. In the case of a topological space S, a sheaf is a suitable functor 
on the lattice O(S) of open sets of S, where the lattice order is defined 
by the inclusion relation between open sets. Thus the notion of a sheaf 
can be explained just in terms of the open sets of S, without any use 
of its points. Any suitable such lattice (one which is complete, with an 
infinite distributive law) may be taken as defining a modified sort of 
topological space, a so-called “locale”. The beginning sections of this 
chapter provide an introduction to the study of such locales, motivated 
by the topological examples. It will turn out that a topological space is 
essentially determined by its lattice of open sets when that space S has 
the property of “sobriety”, but, beyond that point, spaces and locales 
diverge. 

Any continuous map S — T of spaces factors as a surjection fol- 
lowed by an embedding (of a subspace into T). A corresponding but 
more subtle factorization holds for maps of locales; for this purpose 
(§3), a sublocale is described by an operator called a “nucleus”, closely 
resembling a Lawvere—Tierney topology. 

A “localic” topos is one consisting of the sheaves on a locale. 
These topoi have a number of special properties. For example, any 
Grothendieck topos has such a localic reflection (§5). For topologi- 
cal spaces, one often studies those continuous maps S — T which are 
“open”. The open maps of locales and of topoi are defined in a related 
way (§6), in terms of the effect of a geometric morphism on subobject 
lattices and the corresponding left adjoint. 

For Grothendieck topoi, a functor between sites which “preserves 
covers” in a suitable sense can be used to construct an open geometric 
morphism of topoi. Next, the theorems of Barr and Diaconescu show 
that any Grothendieck topos is the image of a surjective geometric mor- 
phism from sheaves on a Boolean algebra or on a locale. A comparison 
of Barr’s theorem with the Stone space of a Boolean algebra (§10) will 
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serve to prove Deligne’s theorem (§11), stating that a coherent topos has 
“enough” points. 


1. Locales 


For a topological space S, the partially ordered set O(S) of all open 
subsets U C S is a lattice A with all finite meets and all joins (finite or 
infinite); moreover, this lattice A satisfies the infinite distributive law 


UAV Vi =\V(U AN) (1) 


for any element U in A and any family of elements V; in A. Indeed, this 
identity (1) is immediate from the fact that finite meet A and supremum 
\V;, in A = O(S) are given by set-theoretic intersection and union. Ac- 
tually the lattice A = O(S) also has infinite meets, where the infinite 
meet ( V; is the sup of all those open sets U with U < V; for all i, but 
this infinite meet is usually not the set-theoretic intersection of the V;. 
(Moreover, the dual infinite distributive law U v A V; = A(U V V;) need 
not hold for the lattice operations on the open subsets of the space S; 
cf. Exercise 1.] 

Any lattice A with all finite meets and all joins (finite and infinite) 
which satisfies the infinite distributive law (1) will be called a frame, 
while a morphism of frames ®: B — A is defined to be a map of partially 
ordered sets which preserves both finite meets and infinite joins. Notice 
that any frame has a largest element 1 (the empty meet) and a smallest 
element 0 (the empty join); thus a morphism ®: B — A of frames 
satisfies 


6(0)=0, @(1)=1, UAV) =G(U)AG(V), H(\/Ui) =\V BU) 
(2) 


for all elements U;, U, and V of B. 

This definition of a morphism between frames is again modeled on 
properties of open subsets of a topological space: if f: S — T is a con- 
tinuous function between spaces, then the inverse image f—': O(T) > 
O(S) (note the opposite direction!) is a morphism of frames. We remark 
that infinite meets do exist in any frame, but a morphism of frames is 
not required to preserve these infinite meets. [In fact, for spaces S and 
T the morphism f—*: O(T) — O(S) need not preserve infinite meets, 
Exercise 1.] 


Lemma 1. A morphism ®: B — A of frames considered as a map 
of posets has a right adjoint ¥: A — B. 
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Proof: If there is such a right adjoint U it must satisfy the definition 
of adjunction for all U € A and V € B: 


V<WU) iff OV) <U. (3) 
This suggests that Y might be defined as 
WU) =\/{V € B| ®&(V) < U}; (4) 


this indeed satisfies (3) since ® preserves all suprema. 


Notice that Y, as a right adjoint, preserves all meets: 
w(A Ui) = A WU). 


For a continuous map of topological spaces f: S — T, the right 
adjoint to the frame-map f—': O(T) — O(S) is denoted by f,; it sends 
an open set U of S to the largest open set V = f,U of T such that 
fUV <U, asin 


FU =V{V 1 FOV CU}. (4a) 


Notice that f, need not be a morphism of frames (Exercise 1). 

As implied in the introduction, we wish to think of frames as general- 
ized spaces (geometry) rather than as special kinds of lattices (algebra). 
Therefore, we introduce the category of locales, defined to be the oppo- 
site of the category of frames: 


(Locales) = (Frames)°?. (5) 


In other words, a locale—i.e., an object of the category of locales— 
is the same thing as a frame, but an arrow between locales is a mor- 
phism between frames in the opposite direction. We denote locales by 
X,Y,Z,---, and the corresponding frames by O(X), O(Y),.... This 
will make it clear whether we wish to think of a given object as sitting 
in the category of locales or in that of frames; this notation also em- 
phasizes the topological intuition behind frames and locales. Similarly, 
if f: X — Y is a map of locales, the corresponding frame map is de- 
noted by f~!: O(Y) — O(X) and its right adjoint (as in Lemma 1) by 
fx: O(X) — O(Y). Thus in (6) below, the left and the right columns 
present the same data viewed in different categories: 


Locales Frames 
f: XY f': O(Y) - O(X) 


(6) 
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There is an obvious covariant functor “Loc” from topological spaces 
to locales: 


Loc: (Spaces) — (Locales), O(Loc(T)) = O(T); (7) 


thus, for a topological space T, the locale Loc(T) of this space is given 
by the frame consisting of open subsets of T. For a map f: S > T of 
spaces, the locale-map Loc(f): Loc(S) — Loc(T) is given by the frame 
morphism f~!: O(T) — O(S). 

It will turn out that this functor Loc has a right adjoint, sending 
each locale X to the “space” of its “points”. However, this will not 
make (7) an equivalence of categories; for example, some spaces S can 
not be recovered from their open sets (distinct points s 4 t may belong 
to the same open sets). In §3 we will show that the functor Loc can 
be “cut down” to an equivalence of suitable subcategories; in the next 
section we will construct the “points” of a locale. 

Also, observe that a frame A is the same thing as a complete Heyting 
algebra (cHa). Indeed, much as 81.8, one can define implication and 
negation operators for elements U and V of any frame A, by setting 


Us=V=\{WeA|WAUK<V}, 


thus 
W<(USV) ifWAUSYV, (8) 
W<-U iff WAU =O. (9) 


However, a morphism of frames is not required to preserve these Heyting 
algebra operations; in fact if f: S — T is a continuous map of topological 
spaces, f need not preserve => or — (Exercise 1). 


2. Points and Sober Spaces 


The one-point space is the terminal object in the category of topo- 
logical spaces, and a point of a topological space S is the same thing 
as a (continuous) map from the one-point space into S. Analogously, 
we define a point of a locale X to be a map of locales 1 — X, where 1 
denotes the terminal object in the category of locales. 

This can be formulated equivalently in terms of frames, as follows. 
The initial frame is the frame { 0, 1 }, consisting of only a bottom element 
0 and a top element 1. Indeed, if A is any other frame, there is evidently 
exactly one frame morphism {0,1} — A, and this morphism preserves 
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bottom and top. Thus, a point p: 1 — X of a locale X is the same thing 
as a frame morphism to the initial frame 


p': O(X) > {0,1}. (1) 


The map p~! can be conveniently presented in terms of its kernel K = 
{U | p~1(U) =0}, a subset of X with the following properties 
1l¢ kK, 
UAVEK iffUEKorVek, (2) 


VUuiEeK iff U, € K for all i. 


Indeed, these three conditions simply state that p-!: O(X) — {0,1} 
preserves finite meets (i.e., the empty meet and binary meets) as well 
as arbitrary joins, as in the definition of a frame-morphism. Conversely, 
any subset K C O(X) satisfying the conditions (2) determines a frame 
map p-': O(X) — {0,1} by p 1(U) = 0 if U € K, p1(U) = 1 if 
U ¢ K, and hence a map 1 — X of locales; that is, a point of X. 

In turn, a subset K C O(X) satisfying (2) yields an element P of 
O(X), as follows 


P=\/K=(V U)E€O(X). 
UEeK 
By the third condition in (2), each U € O(X) has U < PiffU € kK; 
that is, K = |( P) is the downward closure of P. Consequently, the 
first two conditions on K can be translated into conditions on P: 


1¢P, UAVEP ifU<PorV<P. (3) 


An element P € O(X) satisfying the second condition of (3) is sometimes 
called a prime element of the frame O(X), and a proper prime element 
if also 1 4 P. Thus we have shown: 


Lemma 1. The points of a locale X may be described in any of the 
following three equivalent ways: 


(i) as maps of locales p:1 — X; ie. as frame morphisms 
p ': O(X) > {0,1}; 
(ii) as subsets K C O(X) satisfying the three conditions of (2); 
(iii) as proper prime elements P € O(X); i.e., as elements P satisfying 


(3). 


The following identities serve to pass from one description to an equiv- 
alent one: 


K=Ker(p"); P=\/K; K=\|(P). 
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If S is a topological space, an actual point s € S determines an 
evident “point” of the corresponding locale Loc(S). This point can be 
described variously as a proper prime element S' — {s} of O(S), or as 
a subset K, = {U € O(S) | s ¢ U} of O(S), or as a locale-map 
p,: 1 > Loc(S), that is, as a frame map p,!: O(S) — {0,1} with 


Ps: l1—>Loc(S), p,'(U)=0 iffs¢U. (4) 


The space S' is called sober when this map yields a bijection between 
the points s of the space S and the points p of the locale Loc(S). In 
terms of proper prime elements, this is 


Definition 2. A topological space S is said to be sober iff for any 
open subset P € O(S) such that 


(i) PAS, 
(i) UNV CP 3UCPorVCP (allopenU,V CS), 


there is a unique point s € S with P = S — {s}. 


This definition is often phrased in terms of closed sets: a closed 
subset F' C S is called irreducible if it can not be written as the union of 
two smaller closed subsets; that is, whenever F, and F> are closed sets 
with F = F, U Fy, then F, = F or Fy = F. Clearly, if s is a point of S, 
then {s} is an irreducible closed set. Thus S is sober iff every nonempty 
irreducible closed set is the closure of a unique point [for observe that 
for any open set P C S and its closed complement F = S — P, the set 
P is proper prime, as in (i) and (ii), iff F is nonempty and irreducible]. 

The condition of sobriety relates to more familiar conditions: 


Theorem 3. Any Hausdorff space S is sober; any sober space is To. 


Proof: Consider the mapping s + {s} from points of S to irre- 
ducible closed nonempty subsets of S. By the definition of sobriety, S 
is sober iff this mapping is a bijection. Clearly it is injective iff S' is To. 
Also, if S is Hausdorff and F is nonempty, irreducible, and closed, then 
F must be a singleton subset of S. Indeed, if x, y € F' were distinct 
points in F,, then by choosing disjoint open neighborhoods U, and U, 
in X, we find F = (F — U,) U(F — U,), contradicting the irreducibility 
of F, 

For the relation between sobriety and the T,-axiom (“points are 
closed” ), see Exercise 2. 


3. Spaces from Locales 


The previous sections have shown how each topological space S' gives 
rise to a locale Loc(S) for which the corresponding frame O(Loc(S)) is 
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simply the frame (1.7) of all open subsets of S. This section considers 
the reverse process of obtaining a topological space from a locale. 

Given a locale X, write pt(X) for the set of points of X; that is, 
the set of locale maps p: 1 — X. This set of points carries a natural 
topology for which the open sets are the sets of the form 


pt(U) = {p € pt(X) | p '(U) =1} C pt(X) (1) 


for some U € O(X). The subsets of pt(X) of this form do indeed 
constitute the open sets of a topology since, for U, V, U; € O(X), the 
identities 


pt(U AV) = pt(U) npt(V), 
(2) 


pt(\/ Ui) = Upt(u) 


hold, while for the top element 1x € O(X) clearly pt(1x) is the 
set of all points. The identities in (2) are a direct reflection of the 
fact that any point p: 1 — X is defined to be a morphism of frames 
pt: O(X) — {0,1} and so commutes with finite meets and arbitrary 
sups. For example, the second identity in (2) holds since, for any such 
point p, one has p € pt(\/ U;) iff p 4(\VU;) = 1 iff Vp 1(U;) = 1, and 
this is the case in the lattice {0,1} iff p-!(U;) = 1 for some i; that is, iff 
p € pt(U;) for some i. This shows for each locale X that the set pt(X) 
of its points is a topological space in a natural way. 

Observe also that a map of locales f: X — Y induces a function 
pt(f) from points of X to points of Y, simply by composition: 


pt(f): (pis 1+ X)4 (fop:1+X >Y). (3) 
This function is continuous for the topologies (1) on pt(X) and pt(Y); 


indeed, the inverse image of an open set pt(V) C pt(Y), where V € 
O(Y), is open in pt(X), as follows from the readily verified identity 


pt(f)*(pt(V)) = pt(f*(V)). (4) 
Thus, these definitions constitute a functor 
pt: (Locales) — (Spaces). (5) 


Here is the essential property of this functor: 


Theorem 1. The functor pt: (Locales) — (Spaces) is right ad- 
joint to the functor Loc: (Spaces) — (Locales). 


3. Spaces from Locales ATT 


Proof: For a topological space S and a locale X, the bijective corre- 
spondence between continuous functions g: S — pt(X) and locale maps 
f: Loc(S) > X,ie., frame morphisms f~!: O(X) — O(S), is described 
explicitly as follows. Given g we define f~' for each V € O(X) by 


f"(V) = {8 €8| g(s)-"(V) =1} 
=g *(pt(V)). 


This f—! is a frame morphism, because V + pt(V) sends finite meets 
and arbitrary sups in O(X) to finite intersections and arbitrary unions of 
open sets in pt(X) by (2), and these are in turn preserved by g~! because 
g is continuous. Conversely, given a locale map f: Loc(S) — X, define a 
function g: S — pt(X) as follows. For a point s € S, let g(s): 1 + X be 
the locale map given for V € O(X) by g(s)-1(V) =1 ifs € f-1(V), and 
g(s)_'(V) = 0 otherwise. This yields a continuous map g: S — pt(X), 
since the inverse image of an open set pt(V) under g is g-' pt(V) = 
{s€5| 9(s) € pt(V)} ={s € S| gs) '(V) =1} = f-*(V), which is 
an open subset of S. It is now a straightforward matter of spelling out 
the definitions to verify that these operations g++ f and f + g are mu- 
tually inverse bijections Hom(S, pt(X)) = Hom(Loc(S), X). Since these 
bijections are natural, this proves the theorem. 


Next consider the unit and the counit of this adjunction. For a 
topological space S, the unit 


nH: S — ptLoc(S) 


is the obvious map sending an element s € S (an “actual” point of S) 
to the corresponding point 7(s) = p,: 1 — Loc(S) of the locale Loc(S), 
as described in (4) of the previous section. 


Proposition 2. For any topological space S, the following are 
equivalent: 


(i) S is sober; 
(ii) the unit of the adjunction n: S — pt Loc(S) is a homeomorphism; 
(iii) there is a homeomorphism S = pt(X) for some locale X. 


Proof: (i)=>(ii) By the remark before the definition, S is sober iff 
7: S — ptLoc(S) is a bijection of sets. But each open subset U C 
S is also an element of O(Loc(S)), hence gives an open set pt(U) C 
pt(Loc(S)), and n(s) € pt(U) iff n(s)-1(U) = 1 iff s EU. So if nisa 
bijection of sets, then 7 is not only continuous but also an open map, 
since pt(U) is the image of U under 7. Thus 7~! is continuous and hence 
the bijection 7 is a homeomorphism. 

(ii)=>(iii) is immediate. 
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(iii)>(i) Let X be any locale. We need to show that its space of 
points pt(X) is sober. By definition, the open subsets of pt(X) are 
precisely the subsets of the form pt(V), where V € O(X). Suppose such 
an open subset pt(P), where P € O(X), is proper prime and so satisfies 


(a) pt(P) 4 pt(X), 
(b) for any U, V € O(X), 
pt(U) Nn pt(V) C pt(P) = pt(U) C pt(P) or pt(V) C pt(P), 


as in Definition 2.2. We have to show that there exists a unique point 
o:1— X of X such that pt(P) = pt(X) — {¢}. This identity for ¢ 
means that an open subset of pt(X) is contained in pt(P) iff it does not 
contain the point ¢; in other words, for any V € O(X) we must have 


pt(V) Cpt(P) iff @ *(V) =0. (6) 


Clearly, there can be at most one frame-morphism ¢7!: O(X) — {0,1} 
which satisfies (6). To see that there is at least one, regard (6) for each 
V as defining a function ¢~!: O(X) — {0,1}, and check that it is a 
morphism of frames. Indeed, for the top element 1 of O(X) we have 
pt(1) Z pt(P) by (a) above, so @-1(1) = 1. Moreover, for any two el- 
ements U, V € O(X) one has by (2) that pt(U AV) = pt(U) npt(V), 
so (b) above states precisely that @ (UAV) =¢ 1(U)N@ 1(V). Fi- 
nally, for a family U; of elements of O(X), one has \V¢~1(U,) = 0 iff 
¢@ 1(U;) = 0 for all indices i iff pt(U;) C pt(P) for all i, by (6), iff 
pt(V U;) = Upt(U;) € pt(P) iff 6 1(VU;) = 0; so ¢! also commutes 
with suprema. This shows that ¢~! is a frame morphism, and so com- 
pletes the proof. 


The functor pt from locales to spaces is by no means faithful. In 
fact, there are many locales which have no points at all. (Some typical 
examples of “pointless locales” are given in the exercises.) A locale X 
is said to have enough points (or to be spatial) if elements of the lattice 
O(X) can be distinguished by points of X. Specifically, this means 
that for any two distinct elements U, V € O(X), there exists a point 
p: 1 — X such that p-1(U) # p-'(V). Equivalently, X has enough 
points iff, for any U, V € O(X), 


pt(U) = pt(V) > U=V. (7) 


This property of a locale X of having enough points relates to the counit 
e: Loc pt(X) — X of the adjunction of Theorem 1. By the construction 
of the adjunction, this locale map « is defined, for U € O(X), by 


«*(U) = pt(U) C pt(X). (8) 


Proposition 3. For any locale X, the following are equivalent: 
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(i) X has enough points; 
(ii) the counit e: Loc pt X — X is an isomorphism of locales; 
(iii) X = Loc(S) for some topological space S. 


Proof: The implications (ii)=(iii)=>(i) are clear. To prove (i)=(ii), 
notice that the frame homomorphism «~!: O(X) — O(Locpt(X)) = 
O(pt(X)) is always surjective, since by definition the open sets in pt(X) 
are the sets pt(U) for U € O(X), hence are those in the image of «~!. 
But (7) shows that if X has enough points then ¢~! is injective, hence 
an isomorphism of frames. Therefore in this case ¢ is an isomorphism of 
locales. 


Corollary 4. The adjunction Loc: (Spaces) =(Locales) : pt of 
Theorem 1 restricts to an equivalence of categories between the full 
subcategory of those locales with enough points and that of those spaces 
which are sober. 


Proof: By Proposition 2, the image of the functor pt is (contained 
in) the subcategory of sober spaces; and by Proposition 3, the image of 
the functor Loc is contained in the subcategory of locales with enough 
points. So the functors restrict to an adjunction on these smaller sub- 
categories. But again by Propositions 2 and 3, the unit and counit of 
this restricted adjunction are isomorphisms. So there is an equivalence, 
as asserted. 


The equivalence of categories stated in Corollary 4 and obtained from 
the adjunction of Theorem 1 is actually an application of the general 
process already described in Lemma II.6.4 of “cutting down” certain 
adjunctions to equivalences. 

This equivalence may also be regarded as a duality between spaces 
with enough points and certain frames (those “with enough points”). 
To see this, first write the adjunction of Theorem 1 in terms of frames 
as 

O: (Spaces) ——— (Frames)°? : pt (9) 
(left adjoint on the left). Now consider the two-element set 2 = {0,1} as 
a space-the so-called Sierpinski space, with 1 as the only open point. 
Alternatively, consider the same two-element set {0,1} as the initial 
frame—the two-point lattice. Next, any open set U in a space S may 
be regarded as a continuous map S — 2; namely, as that map with U 
the inverse image of the open point 1 of Sierpinski space. Since Loc(S) 
is the locale of open sets of S we can write O(.S) as a hom-set 


O(S) = Homgpaces(S, 2); (10) 


here 2 is regarded as a space in forming this hom-set, while the lattice 
structure on the hom-set is constructed by pointwise lattice operations, 
using the lattice structure of 2. 


480 IX. Localic Topoi 


On the other hand, for a frame A, pt(A) consists of the frame mor- 
phisms to 2 
pt(A) = Homprames(A, 2). (11) 


In this case the topology on the sets pt(A) comes from the topology 
of the Sierpinski space 2 by the familiar construction of the “compact 
open” topology on a function space 24. Specifically, a subbase for the 
topology consists of the sets N = N(C,U) determined by a compact set 
C of A and an open set U of 2 as 


N=N(C,U)={f|f(C) CU}. 


Here we use the discrete topology on the set A, so that a compact set C’ 
might as well be a single element a € A, while the only nontrivial open 
set U in Sierpinski space 2 is {1}. Then the set N for A = {a} is exactly 
the set {f | f: A — 2, fa = 1}, which is one of the sets already used 
to define the topology on pt(A) in (1) above. 

The duality of Corollary 4 between frames with “enough points” and 
spaces which are sober can then be constructed by two hom-sets (10) 
and (11) into the object 2, which is both a locale and a space (with the 
lattice operations continuous in the topology). Isbell has proposed to call 
such an object 2 a “schizophrenic” object for the two categories. There 
are other well-known dualities arising from such objects which have two 
structures (“commuting” with each other). Here are two examples: 


e Pontrjagin duality between discrete and compact Hausdorff 
abelian groups; the schizophrenic object is the circle R/Z. 

e Stone duality between Boolean algebras and clopen sets in a to- 
tally disconnected compact Hausdorff space. Here the two-point 
set 2 is again schizophrenic: as a space with both points taken to 
be open, it is totally disconnected and compact Hausdorff. 


4. Embeddings and Surjections of Locales 


For topological spaces, subspaces and surjections of spaces are de- 
fined in terms of points. For locales, there may not be (enough) points 
at hand so that the descriptions of “sublocales” and of “surjections” of 
locales must be more subtle. Consider a map f: T — S of topologi- 
cal spaces, and the corresponding frame morphism f—!: O(S) > O(T), 
given by the inverse image. If f is a surjective map, then for any open 
set U C T we have ff—'(U) = U, so f~!: O(S) — O(T) is clearly 
an injective morphism of frames. The converse also holds, provided S$ 
is a Ti-space. Indeed, if S is T; and f~': O(S) — O(T) is injective, 
then for any point s € S, the set S — {s} is open and distinct from S; 


4. Embeddings and Surjections of Locales 481 


hence f—~'(.S — {s}) 4 f-1(S) =T, so there must be a point t € T with 
f(t) = s. Thus the surjectivity of a map of topological spaces can (under 
mild separation conditions) be expressed in terms of the injectivity of 
the corresponding frame homomorphism. 

The fact that a map f: T — S of topological spaces is an embedding 
can similarly be expressed in terms of the frame morphism f—!: O(S') = 
O(T). For an embedding f: T— S of a subspace T, the open subsets 
of T are exactly the sets of the form T ™U where U is open in S; thus 
f—1: O(S) > O(T) is a surjection of frames. The converse is again true 
under a separation condition: if T is a Tp-space (in particular, if T is 
sober) and f~!: O(S) — O(T) is onto, one easily shows that f: T— S 
must be an injective map of spaces such that a subset A C T is open 
iff A = f—!'V for some open subset V C S. In other words, if T is a 
To-space then a map f: T — S is an embedding iff f~!: O(S) — O(T) 
is a surjective frame morphism. 

These indications suggest the following definition for maps of locales: 


Definition 1. A map f: Y — X of locales is an embedding 
(respectively a surjection) iff the corresponding morphism of frames 
f-1: O(X) — O(Y) is surjective (respectively, is injective, ie., one- 
to-one). 


In brief, a surjection on the “open sets” means an embedding for the 
locales, etc. 

With this definition, the observations above can be rephrased as 
follows: a map f: T — S of topological spaces is an embedding (respec- 
tively, a surjection) of spaces iff the map Loc(f): LocT — LocS is an 
embedding (respectively, a surjection) of locales, this provided S is T; 
(respectively, T is To). 

Notice that it follows immediately from the definition that a map 
of locales which is both an embedding and a surjection is necessarily 
an isomorphism. Notice also that for maps of locales f: Y — X and 
g: Z > Y, if fog is a surjection then so is f, and if fog is an embedding 
then so is g. 

Recall that for a map f: Y — X of locales, the corresponding frame 
map f—!: O(X) — O(Y), considered as a map of posets, has a right 
adjoint f,: O(Y) — O(X) (Lemma 1.1). The unit and counit of this 
adjunction between posets state that U < f,f~1U for any U € O(X), 
and that f-'f.V < V for any V € O(Y). Moreover, the triangular 
identities for the adjunction reduce to the equalities 


Pohl =f 3. FE Hake (1) 


Lemma 2. Let f: Y — X bea map of locales. The following three 
conditions are equivalent: 
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(a) f is a surjection of locales (i.e., f—' is one-to-one); 
(b) f.f-} = id: O(X)  O(X); 
(c) fs: O(Y) — O(X) is a surjection of posets. 


Also, the following are equivalent: 


(a’) f is an embedding of locales (i.e., f~) is onto); 
(b') fo" fs = id: OY) > OY); 
(c’) fe: O(Y) > O(X) is injective (one-to-one). 


By way of motivation, the reader may wish to check the results in 
the case of a continuous map T — S of spaces with suitable separation 
properties. 

Proof: (a)=(b) If f is asurjection, then the first triangular identity 
in (1) implies that f, f—~1 = id. Next, (b)=>(c) is clear. Finally, if f, is 
surjective then the second triangular identity in (1) yields f,f—! = id, 
hence f—1 is injective; so (c) implies (a). The equivalence between (a’), 
(b’), and (c’) is proved similarly. 

For the embedding f: S — T of a subspace S of a topological space 
T, the open sets U of the subspace S are usually described (in the “sub- 
space topology”) as the intersections U = TMV with open subsets V of 
T. For agiven U Cc S there may be many such V, so U really corresponds 
to the union W of all such V. Now f,U, by its definition (1.4a), is the 
union of all V with f-!V CU, so the union W satisfies f, f—1'W = W. 
In other words, open sets U of the subspace S are in bijection with the 
open sets W of T fixed under the operation f, f—!: O(T) > O(T). It is 
this description of the subspace topology which carries over to locales. 

Consider an embedding f: Y — X of locales. Thus, by (c’) above, 
fx: O(Y) — O(X) is injective, and again by (1) its image consists of 
precisely those elements U € O(X) which are fixed under the operator 
f«f~1: O(X) — O(X). This operator j = f,f7} is (the underlying 
functor of) the monad of the adjunction f~! 4 f,, while the unit and 
multiplication of this monad are maps which can be expressed by the 
following inequalities, for each U in the poset O(X): 


U < ju, (2) 

jjU < gU. (3) 

Notice that by the functoriality of j = f,f—1, (2) implies that jU < j?U, 
so that (3) is in fact an identity 

PU =U. (3" 

Notice also that, since f, and f—! both preserve finite meets (f, as 

a right adjoint and f—! as a frame morphism), we have for any U, 

U' € O(X), 
JU AU) =jU Aju’. (4) 
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For the subspace topology, we noted above that the open subsets cor- 
respond to fixed points of such a j = f.f—!. An operator 7: O(X) > 
O(X) satisfying the identities (2), (3’), (4) is called a nucleus on the lo- 
cale X. This nucleus determines the domain locale Y. Indeed, as stated 
at the start of this paragraph, if f: Y — X is an embedding of locales, 
then O(Y) is isomorphic to the set of fixed points {U € O(X) |jU =U} 
of the nucleus j = f, f—!. The converse of this observation also holds: 


Proposition 3. Let 7: O(X) — O(X) be a nucleus on a locale X. 
Then the poset of j-fixed points {U € O(X) | jU = U} is a frame, 
and j defines a surjective frame-morphism from O(X) into this frame of 
fixed points. 


The locale corresponding to this frame of fixed points {U € O(X) | 
jU =U} is usually denoted by X;—so the corresponding frame is 


O(X;) = {U € O(X) | jU =U}. (5) 


The proposition then states that the nucleus 7 on X determines an 
embedding of locales 
i: X,~-— X (6) 


given by the frame morphism i~!: O(X) > O(X;) defined by i71U = 
gU. [Its right adjoint i,: O(X,) — O(X) is simply the inclusion: i,U = 
U for all U € O(X,).] Locales of the form X; where j is a nucleus on X 
are called the sublocales of X. 


Proof of Proposition 3: By (4) above, the set O(X;) ¢ O(X) of 
fixed points of j is closed under finite meets. Moreover, if {U, } is a 
family of elements of O(X,), their supremum in O(X;) can clearly be 
computed as 7(\/U.), where VU, is the supremum in O(X). To show 
that O(X;) is a frame we need to verify that finite meets commute with 
arbitrary sups [as in (1) of §1]; in other words, that for V and U, in 
O(X,) the identity V A j(V Ua) =7V(V AUQ) holds. But 


VAG(Y Ua) =IV NG Ua) [since V € O(X;), by (5)] 
=j(VA\VUa) [by (4)] 
= iV AUag) [since O(X) is a frame], 


as required. Moreover, by (3’) the operator 7 yields a surjective map 
from O(X) into O(X;); this map preserves finite meets and arbitrary 
sups, as is evident from the description just given of such meets and sups 
in O(X;). Thus j: O(X) > O(X;) is a surjective frame morphism, and 
the proposition is proved. 
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There is clearly a formal similarity between the definition of a nucleus 
on a locale and that of a Lawvere-Tierney topology on a topos. This 
analogy will be pursued in more detail in §5 below. Here we state only 
a factorization theorem parallel to the one for geometric morphisms in 
§VII.4. As in Theorems 4.6 and 4.8, the theorem can be given in the 
following two parts. 


Theorem 4 (Factorization Theorem, existence). Let f: Y > 
X bea map of locales. Then there exists a nucleus 7 on X for which f 
factors through the embedding i: X; — X via a surjection p: 
f 


Y ——— X 


ed 


The proof of this theorem uses the following lemma. 


Lemma 5. Let f: Y — X be a map of locales, and Iet j be a 
nucleus on X with the corresponding embedding i: X; > X. Then f 
factors through i (necessarily uniquely) iff f~10j = f—. 


Proof: (=) Suppose f factors as f = iop for a map of locales 
p: Y > X; (such a map p is necessarily unique since i} = j is surjec- 
tive). Then for any U € O(X), because 7 is idempotent by (3’), we have 
PU = pti (U) = p4U = phigU = F150. 

(<=) Suppose f~1j7U = f—1U for every U € O(X), and consider the 
restriction p—? of f—' to O(X;) € O(X), so that 

p':O(Xj) > O(Y),  p 'U =f. 
Since O(X;) is closed under finite meets, and a sup in O(X;) is computed 
by applying j to the sup in O(X), the identity f-!j = f—! implies that 
p 1 isa frame morphism. Hence p~' defines a map of locales p: Y — X;. 
Moreover, for U € O(X) we have p-'i-1U = f-1jU = f-1U, so that 
ip = f, as required. 

Proof of Theorem 4: The given map f: Y — X of locales yields 
adjoint functors f~1: O(X) 2 O(Y) : f,. Define 7 as the composite 
j = f.f~1: O(X) — O(X); then j is a nucleus. As before, the unit id < 
f«f—} of the adjunction proves that (2) holds, while (3’) follows by the 
triangular identities, and (4) holds since f~! (as a frame morphism) and 
f. (as aright adjoint) both preserve finite meets. The resulting nucleus 7 
defines a sublocale X; and an embedding i: X; — X [where O(X;) and 
i are as in (5) and (6) above]. Moreover, since f-!j = f-'f, f-! = f-} 
by the triangular identity (1), it follows from Lemma 5 that f factors as 
f = poi where p-!: O(X;) — O(Y) is the restriction of f—!. But clearly 
this restriction of f—' to the set O(X;) of fixed points of j = f, f~! is 
injective. So p: Y — X; is a surjection of locales. 
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Theorem 6 (Factorization Theorem, uniqueness). Let 


f:Y 3 X bea map of locales, while Y —» A» X andY + B—»X 
are two factorizations of f. If v is an embedding and p is a surjection, 
then there exists a unique map g: A — B of locales such that gp = q 
and vg = u, as in the commutative diagram 


y—+ .B 


al 
Pp gL7 v 
a 
se 


A4— Se 


Moreover, if u is also an embedding and q is also a surjection, then g is 
an isomorphism. 


Proof: Suppose the given map f: Y — X factors as vg = f = up, 
as in the statement of the theorem. Now first apply Theorem 4 to the 
map v to get a factorization v =i07r as in 


B—+*— X 


NN! 


where r is a surjection, i is an embedding, and 7 is the corresponding 
nucleus on X. Since v is assumed to be an embedding, r is also an 
embedding, hence an isomorphism. Therefore we may without loss of 
generality assume that r is the identity so that v is the embedding of 
a sublocale v = i: X, — X; thus, v-' = j. Since f = vq, f factors 
through X,— X, so f-1j = f~! by Lemma 5. But also f = up, so 
p-tu-j = p-tu!, hence u-17 = u! because p—! is injective. So, 
again by Lemma 5, u factors uniquely through v: X, — X, say as u = vg. 
Then also vgp = up = vq, hence gp = q since v is an embedding, so that 
v, is one-to-one. Notice that g is an embedding if u is since vg = u, 
and g is a surjection if g is since gp = q. In particular, g is iso if u is 
an embedding and gq is a surjection; in other words, the factorization 
of u as a surjection followed by an embedding is then unique up to the 
isomorphism g. 

For a given map f: Y — X of locales, one usually writes f(X) for 
the sublocale of X occurring in the factorization of f as a surjection p 
followed by an embedding i (Theorem 4), to give the diagram 


y —?» f(x) =X; 
j | ee ae (7) 
Xx, 
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By Theorem 6, f(X) is the smallest sublocale of X through which f 
factors. If f is an embedding then so is p, hence p is an isomorphism. 
Thus up to isomorphism every embedding is of the form X;~— X for 
some nucleus j on X. 


To conclude this section we consider some simple examples of sublo- 
cales which resemble open and closed subspaces of a topological space. 
For a given locale X and an element U € O(X), the set (the downward 
closure) |U = {W € O(X) | W < U} is clearly a frame, and there is 
an evident surjective frame morphism 


UA —: OX) |( UV), WHUAW. (8) 


Let us also write U for the locale given by this frame |U;; i-e., the locale 
U defined by the frame O(U) = |U. Then (8) describes an embedding 


f:U>X, fc(W)=UAW, WeEO(X). (9) 


By the definition of implication as an adjoint, the right adjoint f, of f~! 
is the map V+ U = V, for V < U. It follows that U is isomorphic to 
the sublocale X,, for the nucleus j = f,f~1; i.e. 7W = (U > (UAW)) = 
(U = W) for each W € O(X): 


f 


U»————> X 


e | j=(U+(-)), (10) 


Xx: 


3g? 


Sublocales of this form are called open sublocales of X. In other words, 
any element U € O(X) defines a unique open sublocale of X, usually 
called U > X. 

Corresponding to each element U € O(X) there is also a closed sublo- 
cale of X, denoted by X — U, because it is like the (closed) complement 
of an open subset of a topological space. It is defined by the frame (the 
upward closure) 


O(X -U)=1U ={VEO(X)|U<V}. (11) 


The embedding g: (X — U)—X is given by the frame morphism 
g-}: O(X) — O(X —U) defined by 9 1(W) = WVU. The corre- 
sponding nucleus k on O(X) is described by k(W) = W v U for each 
W € O(X). Indeed, O(X —U) = ie U) is exactly the set of fixed points 
of k: 

Xp=(X—-U)4X, k=(-)vU. (12) 
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As a final example, note that for any locale X the negation operator 
— of §1(9) gives a “double negation” nucleus 


a3: O(X) > O(X). (13) 


Indeed, the defining properties U < =7U = ———-U and ——=(U AU’) = 
=U A—-U’ of a nucleus hold, as observed in §1.8. This gives a sublocale 


X_. > X, (14) 


with the property that the frame O(X_..) = {U € O(X) | --U =U} 
is a complete Boolean algebra. 

Let us briefly consider this example in the special case where X is a 
locale with enough points, so that X is of the form Loc(T) for some topo- 
logical space T. Each subspace R C T gives rise to an evident sublocale 
Loc(R) C Loc(T). Indeed, by definition of the relative topology on R, 
the inverse image function i~! of the inclusion i: RT is a surjection 
i-'; O(T) +» O(R). So Loc(i): Loc(R) — Loc(T) is an embedding of 
locales. 

On the other hand, in general, not every sublocale of Loc(T) comes 
in this way from a subspace R C T. For example, if T is a Hausdorff 
space without isolated points, then the sublocale Loc(T)... of Loc(T) 
does not have any points at all. To see this, suppose to the contrary 
that p: 1 > Loc(T)__, is such a point, and write u: Loc(T).... > Loc(T) 
for the embedding, so that u-1(W) = AW for any open set W C T. 
Since T is sober (cf. Theorem 2.3), the composition uo p: 1 > Loc(T) 
corresponds to a unique point t of T, in the sense that for any open 
subset W C T, one has (up)"'W = 1 iff t € W. Thus, in particular, 
0 = (up)-“T — {t}) = pu“ (7 — {¢})) = p(T — {4})). But 
the point ¢ is not isolated in the space T, so —(T — {t}) = @, and hence 
—i(T — {t}) =T. Thus p~)(--(T —~ {t})) = 1, a contradiction. 


5. Localic Topoi 


The definition (Chapter IT) of sheaves on a topological space depends 
only on the lattice of open sets of that space, and so extends at once to 
define sheaves on a locale X. Thus an “open” U in O(X) is said to be 
covered by a family {U; | i € I} of opens of X with each U, < U iff 
U=VU;,; thus 


{U, > U |i €T} covers U iff \/U, =U. (1) 


Then sheaves are defined from these coverings to give the category 
Sh(X). More formally, these are the sheaves for the Grothendieck site 
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given by the base (1) on the category O(X), where there is an arrow 
V -U iff V < U. One may check that this is in fact the “canonical” 
topology on O(X); that is, the largest Grothendieck topology in which 
all representable presheaves are sheaves. 

A topos equivalent to one of the form Sh(X) for some locale X is 
called a localic topos. In particular, this includes the sheaves defined 
on any complete Heyting algebra, since such an algebra is a frame (§1). 
Such sheaves can thus be viewed algebraically (with Heyting) or topo- 
logically (on the locale). 

Using a result from the Appendix, one has 


Theorem 1. For a Grothendieck topos E the following are equiva- 
lent: 


(i) E is localic, 
(ii) there exists a site for € with a poset as underlying category, 
(iii) E is generated by the subobjects of its terminal object 1. 


Proof: Since a frame is a poset, (i) trivially implies (ii). 

(ii)=>(iii) Suppose that € = Sh(P,J) where J is a Grothendieck 
topology on a poset P, and write ay: P — € for the process of 
sheafification a following the Yoneda embedding. Now for each p € P 
the map y(p) — 1 is necessarily monic in presheaves, while sheafification 
a is left exact, hence preserves monics. Thus every map ay(p) — 1 is 
monic, hence gives a subobject of 1. But §III.6(17) showed that the 
images of the ay generate the topos €. 

(iii)=>(i) Let the topos € be generated by the subobjects of its 1. 
The category Subg(1) of these subobjects is a cHa by §III.8, so gives a 
locale X with O(X) = Sube(1). Corollary 4.1 of the Giraud theorem, 
to be proved in the Appendix, then shows that O(X) is a site for € and 
so provides the equivalence € & Sh(X). 


Next we show that maps X — Y of locales correspond to geometric 
morphisms Sh(X) — Sh(Y), much as in the case of spaces. First, observe 
that the locale X can be recovered from the topos Sh(X) of its sheaves. 
Just as for sheaves on a space, in §III.8(17) we observed that a subobject 
of 1 in Sh(X) is simply an element U of O(X) (i.e., simply the presheaf 
Hom(-—,U)); in other words, 


Ov x) — Subgnx) (1). (2) 
Now consider geometric morphisms 
E — Sh(Y) (3) 


from a topos € with all small colimits. Corollary 4 of §VII.9 states 
that these geometric morphisms (3) correspond to continuous left exact 
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functors F: O(Y) — €. Because F is left exact and because every 
object of O(Y) is a subobject of 1, the image of F lies in Subg(1), which 
means that F is really a functor O(Y) — Sub¢g(1). But Sube(1) is a 
cHa and left exactness of F means that F' preserves finite meets, while 
continuity means that F' preserves sups. In particular, if the topos € is 
itself localic, say as E = Sh(X) for some locale X, then Sube(1) = O(X) 
by (2) above, so that F: O(Y) — O(X) actually amounts to a map 
f: X — Y of locales for which the corresponding frame map is f—! = F. 
Therefore geometric morphisms Sh(X) — Sh(Y) correspond exactly to 
maps X — Y of locales. 
More formally, define a category 


Maps(X, Y) (4) 


of maps from a locale X to a locale Y, with objects those maps 
f: X —Y and with arrows f — g the natural transformations of func- 
tors O(Y) — O(X). In other words, the category Maps(X, Y) is a poset 
with objects f, g for which f < g precisely when f—1(U) < g71(U) for 
all U € O(Y). Then the preceding discussion can be summarized thus, 
where Hom again denotes the category of geometric morphisms: 


Proposition 2. The functor X > Sh(X) from locales to topoi in- 
duces for any two locales X and Y an equivalence of categories 


Maps( X,Y) —~+ Hom(Sh(X), Sh(¥)). (5) 


More generally, start with an arbitrary cocomplete topos € (not nec- 
essarily Grothendieck). Then the poset Sub¢(1) of subobjects of 1 is a 
Heyting algebra by §IV.6 and §IV.8 and is complete because € has all 
small colimits. Thus € determines a locale Loc € by 


O(Loc €) = Sub¢(1). (6) 


As we have just observed, geometric morphisms € — Sh(Y) as in (3) 
correspond to left exact continuous functors O(Y) — Subg(1) and hence 
by (6) to maps Loc€ — Y of locales. This shows that the functor 
Loc from cocomplete topoi to locales is left adjoint to the “inclusion” 
Y » Sh(Y) of the category of locales into cocomplete topoi. That is 


Proposition 3. For € a cocomplete topos and Y a locale, there is 
a natural equivalence of categories 


Hom(E,Sh(Y)) = Maps(Loc €,Y). (7) 


Loc €—the locale of subobjects of 1 in €—is for this reason some- 
times called the “localic reflection” of €. 
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Next we consider the relation between sublocales and subtopoi. If 
f: € — F is an embedding of topoi while {G; | i € I} is a family 
of objects of F which generate F, then { f*(G;) | i € I} is a gener- 
ating family for €. For, f an embedding means that f, is faithful, so 
a # 3: E — E’ implies that f,a # f.8, so there is an index i and a 
map u: G; > f,E£ with f,(a)ou ¢ f.(8) ou. Transposing along the 
adjunction f* 4 f, gives a map ti: f*G; — E for which at # ii, so 
f*(G;) is indeed a generating family. In particular, if F is localic, and 
so by Theorem 1 generated by subobjects of 1, this shows that €, too, 
is so generated. By Theorem 1 again this proves 


Lemma 4. Any embedding f: € — Sh(X) of a topos into a localic 
topos forces the domain E to be localic. 


This leads to a more explicit result: 


Proposition 5. Let f: X — Y be a map of locales and 
f: Sh(X) — Sh(Y) the corresponding geometric morphism. 
(i) The map f: X — Y is a surjection of locales iff f: Sh(X) > 
Sh(Y) is a surjection of topoi. 
(ii) The map f: X — Y is an embedding of locales iff f: Sh(X) > 
Sh(Y) is an embedding of topoi. 


Proof: (<=) The inverse image functor fr: Sh(Y) — Sh(X) re- 
stricts by the isomorphism (2) above to the map f—': O(Y) > O(X) of 
posets. The same applies to the right adjoint fe and its restriction /,. 
These restrictions provide a diagram 


O(Y) —~-> Sub(1) c Sh(Y) 


ale z|| (8) 


O(X) ——— Sub(1) CSh(X), 

in which both squares commute—one square consisting of the left-hand 
vertical arrows, and the other of the right-hand arrows. Now f a surjec- 
tion of topoi means that f* is faithful, so that f—! is one-to-one; thus, 
f is a surjection of locales, as explained in Definition 4.1; similarly, fan 
embedding of topoi means that f, is full and faithful (§VII.4) and hence 
that f, is injective; in other words, f is an embedding of locales. 

(=>) To prove the converse of either (i) or (ii), factor the given geo- 
metric morphism f by §VII.4, 


sh(x) —24 sh(Y) 
g h (9) 
E, 
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as a surjection g followed by an embedding u. Then Lemma 4 implies 
that the intermediate topos E in (9) is localic and so of the form € = 
Sh(Z) for some locale Z. Then, by Proposition 2, g and u correspond to 
maps go and uo of locales, which form a commutative diagram (g = go, 
u = Uo) 


Ney 
I~ val (10) 


of locales. Then by the first half me of this proof, go is a surjection of 
locales and wo an embedding of locales. 

Now in case the original map f of locales is a surjection, then so is its 
factor uo, which is also an embedding, hence an isomorphism. But then 
u: € = Sh(Z) — Sh(Y) is an equivalence of topoi, so f is a surjection 
since g is. 

Correspondingly, if f is an embedding, then go is an isomorphism 
and hence g is an equivalence. Therefore f is an embedding since the 
second factor u is such. 

This completes the proof of both implications (<>) of the proposition. 
This result means also that the factorization Theorem 4.4 for locales 
corresponds exactly to the factorization Theorem VII.4.6 for geometric 
morphisms. 


Corollary 6. For any locale X, sublocales of X correspond exactly 
to subtopoi of Sh(X); or, equivalently, nuclei on the frame O(X) corre- 
spond exactly to Lawvere~Tierney topologies in the topos Sh(X). 


Proof: By Lemma 4, an embedding € — Sh(X) from some topos € 
forces € to be localic, hence of the form € = Sh(Y) for some locale Y. 
The first part of the corollary now follows by Part (ii) of Proposition 5. 
The second part of the corollary also follows since, up to isomorphism, 
any embedding of locales Y — X is of the form Y = X, — X for 
a (unique) nucleus j (see §4) while, up to equivalence of topoi, any 
embedding € — F into a topos F is of the form Sh,(F) — Ff for a 
(unique) Lawvere-Tierney topology j (see Corollary VII.4.7). 


6. Open Geometric Morphisms 


A continuous map between spaces is open when it carries open sets 
to open sets. A corresponding (but more subtle) notion of “open” ge- 
ometric morphism has many uses, in particular in its connection with 
“first-order” formulas of the language of a topos to be discussed in Chap- 
ter X. To arrive at an appropriate definition of such “open maps” be- 
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tween topoi, we first take a closer look at open maps of spaces and their 
expression in terms of a suitable adjunction. 

Let f: Y — X be a continuous map between spaces, with the cor- 
responding inverse image map f—!: O(X) — O(Y) of open set lat- 
tices. For any two open subsets U C X and V C Y, one clearly has 
V Cc f-l(U) iff f(V) C U. Thus, if f is an open map, the functor 
V + f(V) from open sets in Y to open sets in X yields a left adjoint fi 
to f~}, both as maps of posets: 


fr OY) OX) :f', Aft. (1) 


This property is preserved under pulling back, as follows. 


Lemma 1. For an open map f: Y — X of spaces and an arbitrary 
map g: Z — X of spaces, the pullback 72 of f along g is again open, 


Y xx Z—2 5 Z 


Y fos xX; 
moreover, the adjoints (1) and (12); 4 ™' satisfy the Beck—Chevalley 
condition, in the sense that for each open subset V of Y, 


9 * AV) = (m2)rmy"(V). (2) 


Proof: For two open subsets V C Y and W C Z the subset 
V xx W = {(y,z) |y € V,z € W and fy = gz} is open in Y xx Z, 
and open subsets of this form yield a basis for the “product” topology 
on Y xx Z. To show that mo is open, it thus suffices to prove that 
2 takes such an open set V xx W to an open set in Z. But clearly 
m2(V xx W) = Wng7'f(V), which is indeed an open subset of Z since 
the map f is open. The Beck-Chevalley identity, (2), by definition of 
the shriek, becomes g~!f(V) = mom, '(V), and is a special case of this 
last equality, for W = Z. 

Let f: Y — X be an open map of spaces, and consider now the 
induced geometric morphism for sheaves, 


f: Sh(Y) > Sh(X). 


As shown in Proposition II.2.4, the open subsets of X are exactly the 
subobjects of 1 in the topos Sh(X) (and similarly for Y). Also the inverse 
image f* restricted to f*: Subgncx)(1) — Subgny)(1) takes a subobject 
U C1, ie., an open subset U C X, to the open subset f-'(U) C Y. 
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Thus (1) states that the restriction f*: Subgn¢x)(1) — Subgncy)(1) has 
a left adjoint. 

More generally, identifying sheaves with étale bundles (Corol- 
lary I1.6.3), the inverse image functor f*: Sh(X) — Sh(Y) sends an 
étale map p: E — X to the (étale) pullback f*(E) =Y xx E— Y, as 
in 


¥ x, Rk = Pre 


bo 


Y —__ +X, 


f 


where we have denoted the second projection Y xx E — E by fr. By 
the lemma, this map fg is again an open map of spaces when f is, so 
determines a map (fz): on open sets and yields an adjunction between 
posets 

(fe): O(f* E) == O(E) : fr’. (4a) 


But, under the equivalence between sheaves and étale bundles, open 
subspaces of the bundle EF correspond exactly to subobjects of E in 
the category Sh(X) of sheaves [and similarly for f*(£)|. Thus one may 
rewrite (4a) as the following adjunction of posets: 


(fe): Subsncy) (f*E) ~~ Subshx) (£) ephs (4b) 


The Beck—Chevalley condition of (2) states that this left adjoint (fz); 
in (4b) is natural in E, in the following sense. If a: E’ — E is a map 
of sheaves on X (again viewed as a map of étale bundles over X) while 
f: Y — X, then adding the vertical map a: E’ — E above (3) gives a 
pullback of spaces 

f* E' f Ez! E' 


ro] [ (5) 


Pag te 
for which both fg and fg are open maps. Therefore by Lemma 1 the 
corresponding diagram obtained from the left adjoints (fr): and (fr): 
and the inverse image maps a! and f*(a)~!, 


Sub(f*E) 2 sub(z) 


ro| Jan (6) 


Sub(f*£’) seas Sub(E’), 
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commutes. It is this property for the subobject posets which will lead to 
a suitable definition of an “open” geometric morphism between topoi. 
Let f: F — € bean arbitrary geometric morphism. Its inverse image 
functor f*: € — F is left exact, hence preserves monos, hence induces 
a functor 
fiz: Sube(E) —— Sub-(f*E) (7) 
of posets, sending each subobject A» E of an object E € E to its inverse 
image in F: 
(A> E) > (f" Ar fr). (8) 
Since f* is left exact, this functor must preserve meets. The functor f* 
also preserves sups. Indeed, for two subobjects A — E and B — E, their 
sup AU B is constructed, as in Chapter IV, by factoring A+ B— E as 
an epi followed by a mono: 


A+B>oAUBXE. 


Thus, since the functor f* preserves sums as well as epis and monos, 
one has f*(AU B) = f*(A) U f*(B). Therefore the functor ff of (7) is 
a functor of lattices. 

In fact, more is true: the functor ff preserves all suprema which 
exist in Sube(£), because it has a right adjoint, to be denoted 


(fz)«: Sub(f{*E) —— Sube(E). (9) 


This right adjoint (fr). sends each subobject m: B— f*E in the topos 
F to the pullback along the unit nz: E — f, f*E of the subobject 
f.B- f, f*E in the topos €: 


(fr)«(B) ———> f.(B) 


| a iB iu (10) 
E—— f.f*(B). 


To see that this indeed a right adjoint to the functor (7) between 
posets, consider any subobject A» E in the topos €. By the uni- 
versal property of this pullback (10), one then has A < (fr).(B) as 
subobjects of EF iff the composition A> E — f, f*(E) factors through 
f.(m): f.(B) — f. f*(E), as in the left-hand square below: 
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By the adjunction f* 4 f,, such a factorization exists iff one exists as 
in the top line of the right-hand square above; ie., iff f£(A) < B as 
subobjects of f*(£). Thus A < (fz).(B) iff f%(A) < B, which states 
that (fz). is right adjoint to ff, as asserted. 

In particular, if € and F are cocomplete topoi, then Sub¢(£) 
and Sub;(f*E) are complete Heyting algebras (or frames), and 
fm: Sube(£) > Subs(f*£) is a morphism of frames. 

Each morphism a: E’ > E in € induces by pullback as in IV.8(10) 
a lattice homomorphism a~!: Sube(E) — Sub¢e(E’) (which we write as 
a! to suggest the inverse image in Sets). Since f* preserves pullbacks, 
there is a corresponding commutative diagram of subobject lattices 


Sube(E) —22— Subs(f*E) 


a | [ro (11) 


Sub¢(E’) cae? Subs(f*E’). 
Ef 


Definition 2. A geometric morphism f: F - € is said to be open 
when for each object E in €, the induced map of subobject posets ff 
has a left adjoint (fz): 


(fe): Sube(f*E) ——— Sube(E): fa, 


which is a map of posets and is natural in E, the latter in the sense that 
each arrow a: E' > E in E yields a commutative diagram 


Sub-(f*E) —/2 Sube(E) 


| fe (12) 
Subz(f*E’) = GEor. Sube (E’). 

The latter diagram is of the same form as Diagram (6) for spaces 
above. Thus every open map of spaces Y — X yields an open geometric 
morphism Sh(Y) — Sh(X). (For the converse, see §7, below.) 

Next, recall from the discussion of quantifiers in Theorem I.9.1 and in 
Chapter IV that, for any arrow a: E’ — Ein a topos €, the correspond- 
ing inverse image a~!: Sube(E) — Sub¢(E’), defined by pullback, has 
a left adjoint 4, (Proposition IV.6.3) and a right adjoint V_ (Proposi- 
tion IV.9.3), as in the following functors between posets: 


a7}: Subg(E) ——> Sube(E’) : Vasa, Fa 47 Ag. (18) 


For a subobject A < E the corresponding subobject 4,(A) of E’ is the 
image obtained by factoring the composite A> E — E’ as an epi fol- 
lowed by a mono. Since the inverse image functor f* of a geometric 
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morphism f: F — € preserves such epi-mono factorizations, it follows 
that this f* commutes with existential quantification, as in the commu- 


tative di * 
ative diagram Sub, (E) _ fe, Subs(f*E) 


| fpr (14) 


Sube(E") —— Subs (f*E’) 


for each arrow a: E’ — E in €. However, the corresponding diagram 
with existential quantification 4, replaced by universal quantification Vo 
does not commute in general. Indeed, for V2, there is just an inclusion. 
To see this, observe first that a subobject B < E pulled back along an 
atrow a: E’ — E in E gives the left-hand pullback below, which is then 
carried by f* to the pullback in F on the right: 


a~}(B) E! f*(a'B) f*E’ 


Pk em 


Bo E, tS ee ah 3) 


The second diagram implies that its upper left vertex is 
f*(a"B) = f*(a)"(f*B). (16) 


In particular, each subobject A < E’ determines a subobject V,A < E 
which [set B = V,A in (16)] satisfies 


(f*(@))"*(f*WaA)) = f(a Wa A) < f*A, 


the last inclusion by the counit a~!V, — 1 of the adjunction a7! 4 Vo. 
Since (f*(a))~* 4 V¢-(q), the transpose of this result is 


F°ValA) < V5-(ay(F*(A)); (17) 


this is the announced inclusion. Our next result shows that this inclusion 
is an equality exactly when the geometric morphism f is open. 

It will follow from this result that a geometric morphism is open 
exactly when it preserves the interpretation of first-order logic (in par- 
ticular, quantifiers). A precise statement will be given in §X.3. 


Theorem 3. Let f: F — E be a geometric morphism. If f is 
open, then its inverse image f* preserves universal quantification, in the 
a 
sense that for each diagram A E' — E in € the identity f*V_(A) = 
Vpe(a)(f*A) holds. And conversely, if E is a cocomplete topos and f* 
preserves universal quantification in this sense, then f is open. 
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Proof: (=>) The identity f*Vq(A) = Vy-(a)(f*A) for each A < E’ 
can be expressed by the commutativity of the diagram of posets 


Subs(f*E’) 22 sube(E’) 


vr] I (18) 


Suby(f*E) ——,— Sube(). 
E 


If f is open, the horizontal arrows in this diagram have left adjoints (fr): 
and (fz)1, and the diagram (18) commutes iff the corresponding diagram 
(12), obtained by taking left adjoints of all arrows in (18), commutes. 
But (12) commutes by the definition of open geometric morphism. 

(<=) Suppose f* preserves universal quantification, so that each dia- 
gram of the form (18) commutes. Then the corresponding diagram (12) 
of left adjoints commutes provided the left adjoints (fz), and (fz): ex- 
ist. To finish the proof of the theorem, we now show for any given object 
E in € that the map ff: Subg(£) — Sub;(f*E) of Heyting algebras 
has a left adjoint. 

First observe that since by assumption all colimits exist in €, the 
set Sube(E) of subobjects of E is now a complete Heyting algebra. The 
crucial observation is that if { A; |i € I} is any family of subobjects of 
E, their infimum A A; can be constructed from the universal quantifiers. 
For the subobjects A; together determine, by Corollary IV.7.6, a single 
subobject [] A; of []; Z. For the map a: [|]; E — E whose components 
are all the identity H — FE we claim that 


Va(] | Ai) = A\ A: (19) 


2E1 tel 


Indeed, if B < E is any subobject of E, then by adjunction B < 
Va([] Ai) iff a-1(B) < [] A;. But pulling back in a topos preserves co- 
products, so pullback of a along B— E gives a 1(B) = [], B< IL, E, 
so a~1(B) < J] A; iff B < A; for each index i. 

Since the inverse image functor f*: E — F is a left adjoint, and so 
always preserves coproducts, this formula (19) also shows that if f* pre- 
serves universal quantification, then f—: Subg(E) — Sub;(f*E) pre- 
serves infima. Therefore, we can propose as the desired left adjoint for 
ff the map 

(fe): Subs(f*E£) — Sube(E) 


of posets, which is defined for each subobject C < f*(E) by 


(fe) (C) = A{U < E|C< f*U)}. (20) 
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To show that this gives an adjunction we will construct the counit and 
the unit. For the counit, note that if C = f(A) for some A < E, the 
subobject A occurs as one of these U, so that (fz)1(ff#(A)) < A. And 
for the unit, we have for any C < f*(E) 


E(fe(C)) = A{ f*(U)|U < Band C < f*(U)}, 


since ff preserves infima as just shown; so clearly C < ff(fre):(C). The 
triangular identities then follow trivially, as for any poset; indeed, for 
any functors F', U of posets, 1 < UF and FU <1 imply F = FUF and 
dually UFU =U. This shows that (20) does define a left adjoint to fF, 
and thus completes the proof of the theorem. 

The definition of an open geometric morphism f: F — E can be 
formulated in terms of suitable maps to the subobject classifiers N¢ 
and Q¢ of the topoi € and F; these are objects which are inter- 
nally partially ordered (in fact they are internal Heyting algebras). 
The universal monomorphism tg = true: 1 — Qe¢ in € has an image 
f* (te): 1 = f*(1) — f* (Qe); by left-exactness of f*, this is a monomor- 
phism in ¥ and so has a classifying map +: f*N¢ — Os, as in the 
pullback diagram 

f*l 1 


ro} [> (21) 


f*(Qe) —,— Oe. 


A subobject A < E is classified in € by a map a as in the pullback 


A —— 1 


[| 


E—— ¢. 


The composite pullback (f* is left exact) 
fra > f*l >I] 


|r Ir 


PEGE Qe —,— Oe 


shows that f*(A) is classified in F by the composite map 7 0 f*(a). Now 
denote the transpose of the map 7 of (21) by 


A: Ne > (QF). (22) 
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Then the transpose of the composite 7 0 f*(a) is 10 a. Thus, composi- 
tion A, with \ corresponds (under classifying maps) to the action of f* 
on subobjects A. This fact is expressed by the following commutative 
diagram, for each object E of E: 


Subs(2) ==" ee 0) 


r| p (23) 


Subs (f*E) —~— F(f*E, Or) SE(E, f.QF). 
Here the top and left bottom horizontal isomorphisms are given by tak- 
ing the characteristic maps of the subobjects in question. Since f* pre- 
serves the partial order, so does 4,. Therefore as in (22) is a map of 
internal posets. 

Now suppose that f: F — € is open. For each object £ the left- 
hand vertical map f*: Sube(£) — Sub¢(f*£) in (23) then has a left 
adjoint (ff):: Subz(f*E) > Sube(£), natural in E. Via characteristic 
maps, (ff): corresponds as on the right of (23) to a morphism 

E(E, fae) = F(f*E, QF) — E(E, Qe). (24) 
By the naturality (12) of the (fz), this morphism (24) is again natural 
in E, so by the Yoneda lemma is given by composition with a uniquely 
determined map in €, 
pe: fxQ¢ Oe. (25) 
This definition of 4, according to which composition with ,: corresponds 
to (fz): via characteristic maps (here written as ~) means exactly that 
for each E € E the diagram 


Subs (f*E) —~— F(f"E, Me) =E(E, fr) 
ue I (26) 


Subg (£) ——>—— £(E, ¢) 
commutes. Moreover, the adjunction of posets between ff, and (fz): 


in terms of the inclusions (fg):(ff#(A)) < A and ff((fe):(B)) > B for 
A<Eand B < f*E translates via (23) and (26) into the inequalities 


por<id: Qe — Neg, 
Nop > id: f.(Qz) > f(r). 
They state that u is an internal left adjoint to the map X of internal 
posets (cf. §IV.9). 

Conversely, if the map 4: Qe — f,(Q¢) has such an internal left 
adjoint p: f.(QF) > Qe, then composition 44, with p determines maps 
(fz): as in (26) which are natural in E and which are left adjoint to fx. 
We have now proved 
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Proposition 4. A geometric morphism f: F — E is open iff the 
canonical map A: Ne > fe(Q¥) Of poset-objects in E [defined in (22) 
and (21)] has an internal left adjoint su: f,(Q¢) > Qe. 

Thus on both counts (Definition 2 and Proposition 4), “open” for a 


geometric morphism means the existence of suitable (external or inter- 
nal) left adjoints, 


7. Open Maps of Locales 

If a continuous map g: T — S of topological spaces is open, then 
each open set U of the domain T has an image g(U) = g:(U) which is 
open in S$ and which satisfies 


gu<V iffU<glV 


for all open sets V of the codomain S. Thus, just as in §6(1), if the 
posets O(T) and O(S) of open sets are considered as categories, gy is left 
adjoint to g~!. Moreover, this left adjoint g, evidently has the following 
additional property: 


g(U Ng *V) = g(U)NV (1) 


for all U € O(S) and V € O(T). This property (1) is often called the 
Frobenius identity or the projection formula (as in the case when g is 
the projection of a product, already considered in 81.9). 

This observation motivates the following definition of an open map 
between locales. 


Definition 1. A map f: Y — X of locales is open iff the corre- 
sponding map f~!: O(X) — O(Y) of posets has a left adjoint f: which 
satisfies the Frobenius identity 


A(VAfFTCU)=f/(V)AU_ for allU € O(X),VEO(Y). (2) 


This definition matches the notion of open geometric morphism, as 
defined in the previous section, in the following sense. 


Proposition 2. A map f: Y — X between locales is open iff the 
corresponding geometric morphism f: Sh(Y) — Sh(X) is open. 


Proof: (<=) If f is open, we will exhibit the required left adjoint pz 
of the canonical map A: Ognrx) > F.Osnv) of §6(22); here € and F of 
§6 are the topoi Sh(X) and Sh(Y). Now, just as for topological spaces, 
the subobject. classifier Nsp(x) for each U € O(X) is given by the set of 
all open “subsets” (sublocales) of U, 


Osn(x)(U) = OV) = {U'! € O(X) | U" CU}. (3) 
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Indeed, one proves readily that Qgn(x) so defined is a sheaf, while for 
any subsheaf S — F in Sh(X) one constructs the characteristic function 
F — Osgnxy exactly as for spaces (§I1.8). Moreover, the map f,: F > € 
of sheaf topoi is defined from the given map f of locales by composition 
with f—’. For the sheaf Ngp:y) this means by (3) that for each U € O(X) 


[fe(Qgncvy (UV) = Osneyy (F772) = O(f UV) 


ee (4) 
={VEOQY)|V<fU}. 


Now the map A: N¢ — Ff, in the present case is a map 


A: Osn(x) > F.Osnv) 
of sheaves on the locale X and so has for each U € O(X) a component 
du: Asn¢x)(U) > Asneyy (FU). (5) 
Also, by §6(23), composition (of characteristic functions of subobjects) 


with corresponds to the action of f* on subobjects. Since f* = f—}, 
this means for each U’ < U that 


du (U"') = f-'(U"). (6) 


Recall that we are assuming that the given geometric morphism f is 
open. By definition, there is, therefore, an internal left adjoint 


BH: Agneyy > Nsncx) (7) 


for the map 4 of internal posets. Hence, by evaluation at U, there is for 
each U an (external) left adjoint 


pu: Usnv)(f-"U) > Agncxy(U) (8) 


for Ay; moreover, this yy is natural in U € O(X) in the sense that, for 
each U’ C U and each V € O(Y), 


pu (V A f7*(U’)) = py (V)NU’. (9) 
Thus, (8) for U = X means that px is left adjoint to Ax = f7}; 
it remains to verify the Frobenius identity. But for V € O(Y) and 
U € O(X) we have 


ux(V A f-1(U)) < ux(f-*(U)) = HxAxU <U, (10) 
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so (9) gives 


ux(VAf"U))=px(VAF"(U))AU__ by (10), 
= py (V A f-*(U)) by (9), 
= px(V)AU by (9) and U CX. 
This is the Frobenius condition (2) for the left adjoint fr = px, so 
completes the implication. 
(=>) Conversely, suppose that the map f~!: O(X) — O(Y) does 


have a left adjoint f; satisfying the Frobenius condition. We may then 
define the desired jz in terms of its components for each U € O(X) 


Osnyy)(f-'U) > Agncxy(U), Le, py: O(f-*U) > OU), 
by setting for each V < f7!(U) 
bu (V) = fv. (11) 


This does give the inclusion py (V) < U because V < f~!(U) and so by 
adjunction f;V < U. To show that the yu so defined is natural in U, take 
U' <U in O(X) and V < f7!(U) in O(V) and calculate 


pu(V) AU! = fi(V) AU! by (11), 
= f(V A f7'U’) by Frobenius, 
= pu(V A f-'U’) by (11). 
As in (9) above, this means that uy is indeed natural in U. Moreover, 
since fy is defined by (10) in terms of fi and since f, is left adjoint 
to f—!, it follows that py is left adjoint to Ay for each U € O(X). 


Therefore, jz: is an internal left adjoint for 4. This completes the proof 
of Proposition 2. 


As for maps of topological spaces, open maps of locales can also 
be described by the condition that images of open sublocales are again 
open. Let f: Y — X be a map of locales, and let Y, be a sublocale of 
Y defined by a nucleus k: O(Y) — O(Y), as in §4. Recall from §4 that 
this locale Y;, is given by the frame consisting of k-fixed points, 


O(Y%,) ={U € O(Y) | kU =U}, 
while the corresponding embedding i: ¥; — Y is the frame-morphism 
i}: O(Y) — O(¥,), i(V)=kV_ [all V € O(Y)); 


its right adjoint i.: O(Y¥.) — O(Y) is simply the inclusion. 
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By Theorems 4.4 and 4.6 we can now factor the composition 
foi: ¥,;~+Y — X in a unique way as a surjection p followed by an 
embedding wu, as in 


Yoo Sy 


| | 
Xj;— > X; 


while, by the explicit description in §4, the sublocale X; is given by 
the nucleus j = (f 07),(f04)7) = fyisi 1 f-! = f,kf7!. For the 
given sublocale Y, of Y, this induced sublocale X; given by the nucleus 
j = fakf~} is called the image of Y;. One also writes 


X;=f(Ye),  G= Sek fo’. 


Recall from §4(10) that the sublocale Y, of Y is called open if its 
nucleus k is of the form k = (V = (—)), for some (necessarily unique) 
V € O(Y). In this case we also denote Y; by V. 


Proposition 3. A map f: Y — X of locales is open iff for each 
open sublocale V of Y its image f(V) is an open sublocale of X. 


Proof: (=) Suppose f is an open map of locales, and let V € O(Y). 
Thus V determines an open sublocale V = Y; — Y whose nucleus k is 
given by k(W) = (V => W), for all W € O(Y). The image-sublocale 
f(V) = f(Ye) of X is thus given by the nucleus f,kf~! on X. But for 
any U, U' € O(X), 


U' < fekf(U) = f.(V > FU) [since k = (V => —)] 
iff f'\(U’)<(V>f-'U)) (since f~" 4 fa) 
if VA f(U') <f7l(U) [by 1.8(1) defining =>] 
if f(VAS 7 (U')) <U (since fi 4 f~*) 
iff f\(V)AU'<U (by Frobenius) 
if U'<f(V)>U [by 1.8(1)]. 


Thus f.kf~'(U) = (f(V) => U) for all U € O(X), so the nucleus 
fakf7! on O(X) determines an open sublocale of X, namely, that cor- 
responding to fi(V) € O(X). 

(<=) Conversely, suppose that for each V € O(Y) the image f (V) 
is an open sublocale of X. By definition, this means that the nucleus 
fa(V > (—))f7? for this image f(V) — X is again of the form U > 
(—), for some U € O(X) depending on V. Write f,(V) for this U. We 
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claim that V +> fi(V) gives a left adjoint to f~! which satisfies the 
Frobenius identity, as required in the definition of open map. Indeed, 
for each V € O(Y) one has by the definition just above of fi(V) the 
following identity of nuclei on O(X): 


(fi(V) > (-))=foV > (-))of™. (12) 
Thus, for any U and U' € O(X), one has 


AV)AU <U if U'< fi(V) >U (§1.8(1) defining >] 

iff U' < f.(V => f-'(U)) [by (12)] 

if f\(U')<Vaf UU) (since f7" 4 fe) 

iff f-'(U') AV < f-'(U) [§1.8(1)]. 
If we now replace U’ by the top element 1 of O(X), we find that fi(V) < 
U iff V < f-)(U). Therefore f; is left adjoint to f~!. But then we can 
continue the sequence of iffs above one step further, as f-!(U’) AV < 
fT) iff A(f- (UAV) <U by adjointness. Thus, fi(V) AU’ < U iff 
A(f- (UAV) <U for all U, and hence f.(V) AU’ = fi(f71(U’) AV). 
This is the required Frobenius identity. 


Next, we compare and contrast subspaces and sublocales. Any subset 
R of a topological space T is itself a topological space with the usual 
relative topology. Moreover, this subspace R defines a nucleus jz on the 
locale O(T) of open sets of T and thus a sublocale of Loc(T) by 


jrn(U) =| {Ve OT) |VNRCU} (13) 
=(J{vV € oO) | (V-U)nR=9}; (14) 


in other words, jr(U) is the largest open subset V of T with VOR CU. 
However, it may be that different subsets R’ and R of T give rise in 
this way to the same nucleus. But we may replace R by the following, 
possibly larger, set R, intended to be the largest such with jr = jz on 


O(T). This set R is defined by 
ceR iff every locally closed subset, L in T which 
contains r has RN L £4 6. 
(Note that if the points of T are all closed, then R= R.) Here, by 
definition, a subset L of T is locally closed in T iff it is the intersection 
of a closed subset of T and an open subset of T, or (equivalently) iff it 


is the difference V — U of two open subsets of T. Therefore R is the 
largest subset of T such that, for all open U and V, 


RN(V-U)=0 => Rn(V-U)=6. (15) 
Then surely RC R so that the reverse implication in (15) also holds. 
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Proposition 4. In a topological space T, 


(i) for subsets R CR’ CT, jp=jn if R'C R, 
(ii) for each subset R, intersection gives an isomorphism of frames 


O(R) imi O(R), (16) 
(iii) if the space T is sober, so is the space R for every RCT. 


Proof: (i) First notice that by (14), for subsets R and R’ of T an 
inclusion R C R’ yields a reverse inclusion jg < jr [i.e., jr’ (U) © jr(U) 
for all U]. But by (14) and (15), R is the largest subset of T for which 
jr < jg. Thus jr < jr implies R’ C R, while conversely R’ C R 
implies Jq < jr’, so surely since jr < jz also jr < jr. 

(ii) By (15), RN(V —U) = Giff RN(V —U) =. But this amounts 
to the statement that RNV C RNUi®& RNV CRN U, which implies 
that intersection with R is an isomorphism O(R) — O(R), as required. 

(iii) Suppose that T is sober and R C T. To prove that Ris sober, 
let P be a proper prime element of the lattice O(R) of open subsets of 
R. By the definition 2.2 of sobriety, we have to show the existence of 
a unique point x € R such that P= R= {x}. Since P is open in R, 
it must be of the form P = KOR for some open subset K C T, and 
for this K we can take K = U{U ¢ T | U open, and UN RC P}. In 
other words, for any open U CT, 


UCK if UNRCP. (17) 


It follows from (17) that K is a proper prime element of O(T), since P 
is one in O(R). Since T is sober by assumption, there is a unique point 
x €T with K =T — {a}. It follows that P= KN R = (T- {x})NR= 
R- ({z}N R). Now {z} NR is the closure of the point x in the subspace 
R, provided x € R. So it remains to show that x € R. To this end, take 
any locally closed set E C T with x € L; say L = U—V where U, V 
are open in JT. Then z € U and x ¢ V, so since R272 (ay we lave 
U ¢ K and V C K; or by (17), UNRZ PandVORC P. Thus 
UNRZ VOR, and hence by Part (ii) of the lemma UN R Z VNR. 
Thus RN(U —V) #90. This holds for any locally closed set L = U—V 
containing x. So x € R by definition of R, as was to be shown. 

This completes the proof of Proposition 4. 

Finally, we can relate “open map” for spaces with “open map” for 
locales, as follows. 
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Proposition 5. Let f: T — S be a continuous map of topological 
spaces with S a7,-space. Then f is open iff the induced map of locales 
Loc(T’) > Loc(S) is open. 

Proof: By Proposition 3 above, the map Loc(f): Loc(T) — Loc(S) 
of locales is open iff for each open sublocale of Loc(T) its image in 
Loc({S) is open. Since O(Loc(T)) is simply the lattice of open subsets 
in the space 7’, an open sublocale of Loc(T) is given by the nucleus 


Raa) 
on O(T), for some open subset V C T. As before, the image-sublocale 


in Loc(S) is then given by the nucleus j = f,kf~!. Explicitly, for this 
V and each open U C S, 


j(U) = fakf~(U) 
= iw C S| W open, fw) < kf~(U)} (since fo 4 f.) 


=(J{W S| W open, f-'(W) NV < f-(U)} 
=|(J{W CS |W open, Wr f(V) CU} 


[the latter equality holds since f-'(W)NV < f-(U) iff WNF(V) <U 
by a point-set calculation]. In other words, j is exactly the nucleus j fV) 
as in (13) above, corresponding to the point-set image f(V) C S. 

Now if f is an open map of spaces, then f(V) is an open subset 
of S and the nucleus j = jscv) = (f(V) = (—)) evidently determines 
an open sublocale of Loc(S). So by Proposition 3, Loc(f): Loc(T) — 
Loc(S) is an open map of locales. Conversely, if the latter map of locales 
is open, then for each open subset V C T this nucleus j = jy), which 
describes the image of V as a sublocale of Loc(S), must be of the form 
(A => (-)) =ja: O(S) — O(S) for some open subset A C S. Thus, the 
subsets A and f(V) determine the same nucleus, and hence f(V) = A 
by part (i) of the preceding proposition. But S is a T,-space, so the 
points of S are closed, and hence R = R for any subset R C S as is 
evident from the definition of the operation R > R. Therefore, from 
fV) = A we conclude that f(V) = A; ie., f(V) is open since A was 
open. This shows that f is an open map of spaces, and so completes the 
proof of the proposition. 


8. Open Maps and Sites 

Given two sites (D, K) and (C, J), each functor 7: D > C with a 
suitable property (the clp) induces a geometric morphism on the corre- 
sponding sheaf categories 


f: Sh(D, K) — Sh(C, J). (1) 
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This section will show that the added condition “x preserves covers” 
will imply that this induced morphism f is open. The result will have 
an immediate application in the construction of the next section of the 
Diaconescu cover. 

First recall from §VII.10 that *: D — C has the covering lifting 
property (clp) for the given topologies iff, whenever a J-sieve S € J(xD) 
covers the image 7D of an object D of D, there is a covering K-sieve 
T € K(D) with xT C S. By Theorem VII.10.5 the geometric morphism 
f then exists and the inverse image functor f* for (1) is described, for 
any sheaf E on C, by the formula 


F*(E) = a(E on); (2) 


here a(E o 7) is the associated K-sheaf of the presheaf E oz on D. 

Recall also from §VII.10 that the functor 7: D — C is said to pre- 
serve covers when for any covering sieve T € K(D) in D the sieve (xT) 
in C generated by its image { 7(g) | 9 € T} is a J-cover of a(D) in C. 
The intended condition for f to be open now reads as follows 


Proposition 1. Let f: Sh(D, K) — Sh(C, J) be a geometric mor- 
phism induced as in (1) by a functor 7: D — C with the clp. If 7 
preserves covers, as above, and if for each object D € D the induced 
functor 7/D: D/D — C/xD on the slice categories is surjective on 
objects, then f is an open geometric morphism. 


The proof starts with a J-sheaf E on C, the composite presheaf 
Eox: D — CP — Sets on D, its associated sheaf f*E = a(£ 0 7) 
as in (2), and the lattices 


SubSh(f*£) and SubPr(£ oz) 


of subsheaves and subpresheaves, respectively. It is helpful to observe 
that the first lattice can be described without using the sheafification 
a, but directly in terms of the presheaf E o 7 and the closure opera- 
tion defined on presheaves by the given Grothendieck topology. Indeed, 
Corollary V.3.8 describes an isomorphism, induced by sheafification, 


ClSubpr(E 0 7) & SubSh(f*E) = SubSh(a(E 0 7), (3) 


between the lattice of closed subpresheaves of E o 7 and that of sub- 
sheaves of f* E. 

We need a characterization of closure in terms of covering sieves. In 
general, if P is a presheaf on a category D, each element d € P(D) and 
each subpresheaf A C P together determine a sieve Sg4 on D by 


Saa={g9: D) > D|d-g€ A(D’)}, dé P(D), (4) 
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consisting of those g which “pull” d into A. Moreover, by (6) in §V.4, 
AC P is aclosed subpresheaf iff for all objects D of D and all elements 
de P(D), 

Sa,a covers D implies d € A(D). (5) 


Lemma 2. Under the assumptions of Proposition 1, with E a sheaf 
on C, the induced map fy: SubSh(£) — SubSh(f*£) is given by com- 
Position with m followed by the isomorphism (3): 


Q B SubSh(E 

7 

' 

! / + 6) 
a 

Q Bor ClSubpr(£ ox) ———=>——> SubSh(f 


(3) 

At the left of this diagram, B+ Bow and Qh Q displays an 
adjunction still to be constructed. 

Proof: Consider any subsheaf B C FE in Sh(C, J). As in (2) above, 
its image f*(B) = a(Bo 7) under ff is the associated sheaf of the 
presheaf Box on D. Since the isomorphism (3) from closed presheaves 
is given by the sheafification operator a, it will suffice to show that the 
subpresheaf Boa of Ho7 is closed in Eo. To this end, take any 
object D of D and consider an element d € (HE 07)(D) for which the 
sieve SgBox of D defined as in (4) is a cover of D. Since the functor 
x: D — C is assumed to preserve covers, this implies that the sieve on 
m(D) generated by 7(Sq Box) covers 7(D) in C. But by (4) the arrows 
nag: TD! — ©D for g € Sa,Bor are those for which d-g € B(rD’), so 
they are among the arrows h in the sieve 


Sap ={h: C’ +>xrD=C|d-he B(C’)} 


on 7D. Thus, the sieve Sg gz contains the given covering sieve 7(Sa,Bor), 
so it must itself be a covering sieve on C'= 7D. But B C E is not only 
a subpresheaf, but is assumed to be a subsheaf, so by Lemma JII.7.2 it 
is closed in E. Therefore, d € B(xD). This shows that Boz is a closed 
subpresheaf of Eo 7, as required for (6). 

Proof of Proposition 1: According to the Definition 6.2 of an 
open geometric morphism, we have to construct for each J-sheaf EF on 
C amap (fz): of posets which is natural in E and left adjoint to the 
map 

fe: SubSh(£) — SubSh(f* £) 
displayed at the right of diagram (6). By the lemma, this map can be 
identified with the map 


SubSh(£) > ClSubpr(£ oz), BCEwBorcEon. (7) 


8. Open Maps and Sites 509 


To construct an adjoint to this map, consider a closed subpresheaf Q C 
Eon, Given an object C, each element e € E(C) determines a sieve 
Te,q on C' by 


T. = Te,g is generated by those h: r(D) > C with e-heQ(D) (8) 


[since Q C Eon, we have Q(D) C E(xD)|. Then define Q(C) c E(C) 
by 
e€Q(C) iff T.,g covers C,e € E(C). (9) 


We intend to define the desired adjoint (fz):(Q) to be Q. But first 
observe that @ is a subpresheaf of E. Indeed, if e € Q(Cc) so that T..9 
covers C as in (9), while u: C’ — C is any arrow in C, the stability 
axiom for the Grothendieck topology J on C states that the pullback 
u*(Te,g) of Te,g along u is a cover of C’. By definition, this pullback 
consists of those arrows k: C” — C’ for which u ok € T, 9; that is, of 
those arrows k for which [according to the definition (8) of T,| there is 
a commutative square 


Cc" ——+1D 


Care 

in C with e-h € Q(D). The assumption that the slice functor 
t/D: D/D — C/xD is surjective on objects then implies that there 
is an arrow w: D’ > D in D with x(w) = v and hence with 7D’ = C”. 
By the definition of T, one then has e-u-k = (e-h)-v € Q(D’). There- 
fore each k € u*(T,) is in T,,,, in other words, u*(Z.) C Te. Hence Teun, 
like u*T., is a cover of C’. By the definition (9) of Q this means that 
eu € Q(C’) for all u; in other words, Q is indeed a subpresheaf of E. 

To see that this subpresheaf Q is closed in E, we consider for each 
C and each e € E(C) the sieve 5.5 on C defined as in (4) by 


Sg ={u: 0! > C with e-we Q(C’)}. 
By the definition (9) of Q this means that S is 
5.6 ={u:C’ 3 C|T.., covers C’}. (10) 
As in the description (5) of closed subpresheaves, we wish to prove that 


5.5 covers C implies e € Q(C). (11) 
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But T..., is defined as the sieve generated by all those arrows h: 7D — C’ 
with (eu)h € Q(D), while for these arrows u, (eu)h = e(uh), so uh € Te 
and thus h € u*T., which means that T., C u*T.. Now if Ss. @ covers 
C' as in (11) then, for every u € 5.o the sieve T,,, covers C’ and 
hence the pullback u*T, also covers C’. The transitivity axiom for a 
Grothendieck topology then states that T, covers C’. Therefore, by (9), 
e€ Q(C). Thus (11) holds, so that Q is a closed subpresheaf of E. 
By Lemma III.7.2 it is, therefore, a subsheaf of E, so that we have 
constructed the desired functor Q > Q 


(fz): ClSubpr(Eo7) > SubSh(E), (fz)Q=Q. (12) 


Finally, we verify that the functor (fz): so constructed is indeed a 
left adjoint to the functor ff, as the latter was identified in (7) with the 
functor Bw Bon for BC E. First, we show for each such subsheaf B 
of the sheaf EF that, under the map Q+ Q, 


(Bon) =(fz)\(Bom) CB. (13) 


Indeed, if e € E(C) is such that e € (Bo7)(C) then, by definition (9) 
of Q, the object C is covered by 


Te,Bor ={h:tD > Cle-he B(xD)}, 
a sieve on C’. But this sieve is contained in the following sieve on C 
Sep={g:C'’ >Cle-g€ B(C’)}; 


so, by the properties of coverings, this latter sieve also covers C’. Since 
B is a subsheaf—that is, a closed subpresheaf—of E, the criterion (5) 
above for closure shows that e € B(C) and so proves (13). 

Second, take Q C Eom, any closed subpresheaf of Fox. Then for any 
D ED with C= 7D and any e € Q(D) C E(xD) = E(C) the identity 
arrow 1p belongs to T. g, a sieve on 7D. Therefore Tg, as the maximal 
sieve, must cover 7D = C. Hence, by (9), e € Q(C) = [(fe)1(Q)|(C). 
Thus s 

e € Q(C) = [(fe)(Q)|(C) = ([(Fz):(Q)] 0 7) (P). 


This holds for all e € Q(D), so 
Q © [(fz):Q| on. (14) 


But f# has been identified in (7) with — o7. Thus, the first and 
second arguments above together yield the counit (13) and the unit (14) 
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for the desired adjunction (fz); 4 f#; the triangular identities for unit 
and counit follow formally, as for any poset. 

To complete the proof, one must observe that the functor (fr); of 
(12) is natural in E, in the sense that each arrow a: E’ — E in Sh(C, J) 
makes the diagram (12) of §6 commute. But by the isomorphism (3) we 
may equivalently write this diagram as 


ClSubpr(E 0 7) “2, SubSh(Z) 


(oor [a 


ClSubpr(E’ o 7) ae SubSh(£’), 
Bt ' 
and it follows from the explicit description (9) and (12) of (fg): that 
this diagram indeed does commute. 


Remark 3. If in the situation of Proposition 1 we also assume that 
m is surjective on objects, then clearly the functor (7) of lattices is injec- 
tive. Since by Lemma 2 this functor is, up to isomorphism, the functor 
fie: Sub(£) — Sub(f*£) of subobject lattices, it follows by Condi- 
tion (iv) of Lemma VII.4.3 that in this case the geometric morphism 
f: Sh(D, K) — Sh(C, J) of Proposition 1 is an open surjection. 


9. The Diaconescu Cover and Barr’s Theorem 
The main result of this section is the following 


Theorem 1. For every Grothendieck topos € there exists a locale 
X and an open surjective geometric morphism 


Sh(X) — €. (1) 


The locale X to be constructed in the proof is called the Diaconescu 
cover of E. 


Proof: The given Grothendieck topos € is equivalent to the category 
Sh(C, J) of sheaves on some site (C, J). Now if we had a site (S, K) 
formed from a category S which is a poset, its sheaves by Theorem 5.1 
would form a localic topos. A suitable functor 7: S — C then might 
give an open geometric morphism in accord with Proposition 8.1 just 
above. Hence we search for a suitable poset S with a map 7 onto the 
given category C. 

We take this poset S to consist of strings of arrows of C, where a 
string is to be a sequence 


2 (Cy Oh 5.) (2) 
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of composable arrows in C, These strings carry a natural partial order: 
for two strings s and t we write t < s if t prolongs s to the left. [So for s 


as in (2), t must have the form Chim — ... 7 Ch pala eae Co.] This 
defines a poset S = String(C) of strings in C. This poset is a category 
in the usual way, with an evident projection functor 


a: String(C) — C 


given on the objects s of (2) by x(s) = C;,; on inclusions t < s, 7 is 
defined as the evident composition. 

We now equip the poset String(C) with a Grothendieck topology K, 
by defining a sieve U on an element s € String(C) to be a K-cover of s 
iff for any t < s in String(C), the set of arrows x(t’ < t): a(t’) > (t) 
where t’ € U form a J-cover of z(t) in C. [Here we have identified the 
sieve U on s with a downward closed subset of { t’/ € String(C) | t! < s}.] 

This does indeed define a Grothendieck topology on String(C) (the 
transitivity axiom follows from the transitivity axiom for the topol- 
ogy on C, and the stability axiom for K is “built into its defini- 
tion”). By Theorem 5.1, the resulting topos Sh(String(C), A’) is localic. 
Thus to prove the present theorem, it suffices to show that the functor 
mw: String(C) — C satisfies the conditions of Proposition 8.1 and Re- 
mark 8.3. It is evident that the functor 7 preserves covers and that, for 
each string s, the induced functor String(C)/s — C/7(s) is surjective 
on objects. 


To see that 7 has the clp, consider a string s = (C,, se Co) 
as in (2), and let R be a covering sieve on its image C, = x(s) in C. 
Define a sieve U on s by 


U={t'|t'<s, r(t’<s)eR} 
= {(Qnim)---,Qn,+--,@1) € String(C) | Qn41 O-+-OAnim € Rf. 
Then xU C R, so it will be enough to show that U is a K-cover of s. To 


An+m 


this end, take any t < s, say t = (Cham ———>... > Ch par 1. oC) 
and write g = a(t < 8) = Qn410°++ 9Qn4m. Let R’ = g*(R). Then, 
by the axiom of stability for J, R’ is a J-cover of Cnim in C. Also 
n(UUNn{t’ | t <t}) contains R’, so is a J-cover. Thus, by the definition 
of K, this U is a K-cover of s. This shows that the functor 7 has the 
clp. 

Proposition 8.1 now implies that 7 induces an open geometric mor- 
phism 

Sh(String(C), A) — Sh(C, J). 

Since a is also surjective on objects, Remark 8.3 shows that this ge- 


ometric morphism is also surjective. This completes the proof of the 
theorem. 
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Theorem 2 (Barr’s Theorem). For every Grothendieck topos € 
there exists a complete Boolean algebra B and a surjective geometric 
morphism Sh(B) — €. 


Recall that Sh(B) is the topos of sheaves on the Boolean algebra B 
with the usual sup topology [as defined for any cHa in Example III.2(d)]. 
The complete Boolean algebra B is a frame, and Sh(B) is the topos of 
sheaves on the corresponding locale |i.e., the unique locale Y specified 
by O(Y) = B] as defined in §5 above. 

Barr’s theorem follows from Theorem 1 and the following result 
about, locales. 


Lemma 3. For every locale X there exists a surjection Y — X of 
locales for which O(Y) is a complete Boolean algebra. 


Proof: Recall from §4(11) that if X is a locale, we may for each 
U € O(X) define a closed sublocale X — U of X, with O(X —U) & 
{V €O(X)|V >U}. Also, the frame of =—-fixed points of O(X —U) 
gives a “double negation” sublocale (X — U)_.., as in §4(14). Now set 


Y= |] (*-U)... 


UEO(X) 


Here “[[” is the coproduct in the category of locales. This coproduct 
is constructed as the product in the opposite category of frames. [So 
O(Y) is the product of the frames O(X — U)_.., with operations of 
supremum and infimum taken pointwise.] There is a canonical map of 
locales p: Y — X, defined on each summand (X —U)__, as the composite 
embedding py : (X —U).. -(X —U)—X. This map p is a surjection 
of locales; i.e., p71: O(X) — O(Y) is an injective frame map. For if U, 
V € O(X) and U < V whileU # V, then p;;'(U) = 0 but p'(V) 4 Oin 
O(X —U)__.. Furthermore, since each frame O(X —U)__, is a complete 
Boolean algebra by §4(14), so is their product O(Y). This proves the 
lemma. 


Barr’s theorem is useful for cohomology. In the study of sheaf coho- 
mology of a topological space X one employs the Godement resolution 
in the category of sheaves (of modules) by finding a surjective geometric 
morphism Sh(Y)-» Sh(X) so that epimorphisms e split in Sh(Y) [ie., 
for each epimorphic e there is s with es = 1]. Such a space Y is easily 
found by taking the points of X with the discrete topology. The original 
purpose of Barr’s theorem was to generalize this Godement resolution 
to the context of the cohomology of an arbitrary Grothendieck topos € 
by constructing another topos B in which epimorphisms split and with 
a surjection B— €. For this B, recall that Proposition VI.1.8 shows that 
epis split on the topos of sheaves on any complete Boolean algebra. 
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10. The Stone Space of a Complete Boolean Algebra 


We begin this section by reviewing the well-known construction of 
the Stone space of a Boolean algebra. This construction, discovered by 
Marshall Stone in 1936, shows that every abstract Boolean algebra is 
isomorphic to a suitable algebra of sets. His representation was explic- 
itly topological, by the closed-and-open subsets of a compact. Hausdorff 
space, now called the Stone space of the Boolean algebra. Stone’s origi- 
nal construction was formulated by regarding the Boolean algebra as a 
special type of ring (a Boolean ring), and then using the space of prime 
ideals (now called the spectrum) of this ring. In our presentation, we 
replace the prime ideals by the (essentially equivalent) maximal filters. 

Let B be a Boolean algebra; as usual, we shall write < for the par- 
tial order on the elements of B, 1 and 0 for the largest and the smallest 
element, and A, V, — for the operations of forming the infimum and 
supremum (of two elements) and the complement (of an element). Re- 
call that a subset, F C B is called a filter if it has the following three 
properties: 


leF, 0¢F, (1) 
ifa<banda €F, then be F, (2) 
ifaeéFandbe F, then aAdbe F. 


These filters are partially ordered by inclusion. A filter m is maximal if 
it is contained in no other filter; that is, for any filter F' such that m C F 
one has m = F. We will use Zorn’s lemma to prove that every filter is 
contained in a maximal one. 


Lemma 1. For a filter m in a Boolean algebra B, the following 
conditions are equivalent: 


(i) m is maximal; 
(ii) for each 6 € B, either b € m or 7b € m; 
(iii) for any a, b CB, ifavb Em then a Emordb em. 


Proof: The implication (iii)=(ii) is clear, since b V =b = 1. For 
(ii)=>(i), suppose that m is a filter satisfying condition (ii). If F D m is 
a filter which properly contains m, then there exists a b € F with b ¢ m. 
Hence by condition (ii) for m, we have —b € m. But m C F, so also 
ab € F, and hence 0 = bA-0) € F since F is a filter. This contradicts 
condition (1) for filters. Finally, to prove (i)=>(iii), suppose that a,b ¢ B 
are two elements such that a V } belongs to a maximal filter m, while a 
does not belong to m. We will show that 6 € m. Construct from a the 


set 
F={y¢eB|izem(zAaK<y)}. 
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One readily checks that F > m, and that F satisfies all conditions (1)- 
(3) for a filter except perhaps the condition that 0 ¢ F. Buta ¢ F 
while a ¢ m, so F cannot be a filter by maximality of m. Thus 0 € F, so 
there is some z € m with x \a=0. But then 6 > 2Ab=(xAb)VO= 
(x Ab) V(t#Aa) =xA(aVb), 506 Em since both g €mandavb em. 
The proof is complete. 


For an element b € B, define a set D(b) of maximal filters by 
meD(b) iffb> em. (4) 
Lemma 2. For any elements a, b € B: 


(i) D(a) N D(b) = D(a Nd); 
(ii) D(a) U D(b) = D(a d); 
(iii) D(a) C D(b) iffa <b. 


Proof: By conditions (2) and (3) for any filter F, one has aAb € F 
iff a € F and b € F, so (i) is clear. Next, part (ii) of the lemma 
follows directly from part (iii) of the previous lemma. It thus remains 
to prove (iii). Clearly a < b implies that D(a) € D(b), by condition (2) 
on filters. For the converse, suppose that D(a) C D(b), while a < b. 
Then a A-b # 0, so the set F = {x2 € B | aA-7b < x} is a filter 
in B. By Zorn’s lemma, F is contained in some maximal filter m. In 
particular a Ab € m; whence @ € m since (a A-b) < a, and b ¢ m since 
bA(aA-b) =0. This contradicts D(a) C D(d). 

It follows from part (i) of this lemma that the sets D(a), for all a € B, 
form a basis for a topology on the set of all maximal filters in B. The 
resulting topological space is called the Stone space of B, in symbols, 
Stone(B). So the points of Stone(B) are the maximal filters in B, and 
a subset U C Stone(B) is open iff for each maximal filter m € U, there 
exists a 6 € B with m € D(b) CU. It is well-known and not difficult to 
prove that Stone(B) is a compact Hausdorff space, and that the compact 
open subsets of Stone(B) are exactly the basic open sets D(b). Thus, 
the Boolean algebra B is isomorphic to the lattice of compact open 
subsets (equivalently, those subsets which are both closed and open) 
of the space Stone(B). This gives the so-called “Stone-duality”. (For 
details, see, e.g., [Halmos] or [Johnstone, 1982].) Of course, the Stone 
space can equivalently be defined as the space of prime ideals p in the 
Boolean algebra B, with basic open sets of the form {p | b ¢ p}, for 
each 8 € B. Indeed, each maximal filter m determines a prime ideal 
p = {b| -b em}, and conversely. 

Here we will be interested in a somewhat different aspect of the Stone 
space, for the special case where B is a complete Boolean algebra. In 
particular, B is then a frame (§1), and one can define a canonical map 


¢: O(Stone(B)) > B (5) 
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from the frame of open subsets of the Stone space back to the complete 
Boolean algebra B, by setting 


g(U) = \/{b eB | Dis) CU} (6) 


for each open U C Stone(B). 


Proposition 3. For every complete Boolean algebra B, the map 
@: O(Stone(B)) — B defined by (6) is a surjective homomorphism of 
frames. 


Proof: By part (iii) of Lemma 2, one has 


g(D(b)) =6 (7) 


for any element b € B, so ¢ is surely surjective. We have to show 
that ¢ preserves finite meets and arbitrary sups. Clearly ¢ preserves 
the top element, i.e., the empty meet. For binary meets, consider open 
subsets U and V of the Stone space. Clearly, (UNV) < ¢(U) and 
@(UNV) < AV), so (UNV) < dU) A@(V). For the converse we have 


gU) A dV) =\/{a €B| D(a) CU}A\V/{b €B| Db) CV} 
=\/{aAb| D(a) CU and D(b) CV}, 


where the latter identity follows from the infinite distributive law [§1(1) |. 
But for any such a and 6 with D(a) C U and D(b) C V, also D(a Ab) = 
D(a)ND(b) C UNV, so aNb < d(UNV). Thus, 6(U)Ad(V) < d(UNV). 
This shows that ¢ preserves binary meets. 

For suprema, consider a family {U; | i € I} of open subsets of the 
Stone space. Clearly, V,-; @(Ui) < (Uj-; Ui). To prove the converse 
inequality, consider any element b € B such that D(b) C Uie; Ui. We 
will show that 6 < \/,-,¢(U;). Since the subsets of the form D(a), 
a € B form a basis for the topology, D(b) C UU; implies that D(b) C 
U, D(ae), where { a¢ } is a collection of elements from B such that, for 
each €, the set D(ag) is contained in some U;. It now suffices to show 
that 6 < \/, a¢, for then also b < V;-, 6(Ui) since each ag < (Uj) for 
some i. Suppose to the contrary that 6 £ \/, ag. Then bA(V/, ag) #0, 
so just as in the proof of Lemma 2(iii), Zorn’s lemma gives a maximal 
filter m with 6A =(V,a¢) Em. Sob Em but Vag ¢ m. But for any 
particular ), we have ag, < Ve a, hence also ag, ¢ m; i.e., m ¢ D(ag,). 
Since this holds for any &, we find that m ¢ ) D(a¢); this contradicts 
D(b) C Uz D(a¢), and the proof is complete. 

Now compare the topos Sh(Stone(B)) of sheaves on the Stone space 
with the topos Sh(B) of sheaves on the given complete Boolean algebra 
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B. The surjection (5) of frames ¢: O(Stone(B)) — B can be viewed in 
the opposite direction as an embedding of locales (Definition 4.1), and 
hence induces an embedding 


i: Sh(B) ——— Sh(Stone(B)) (8) 


of topoi, as in Proposition 5.5(ii). As explained in §5, this geometric 
morphism 7 is induced by the left-exact and continuous functor obtained 
by composing ¢ with the Yoneda embedding 


O(Stone(B)) —* B —*—  Sh(B). 


By the explicit description given in Theorem VII.10.2, the direct image 
functor 
i,: Sh(B) ——— Sh(Stone(B)) 


is given by composition with ¢; so, for a sheaf F on B and any open 
subset U of the Stone space, 


i.(F)(U) = FU). (9) 


Furthermore, the inverse image functor i*: Sh(Stone(B)) — Sh(B) fits 
into a commutative square 


O(Stone(B)) ———— Sh(Stone(B)) 


| Ir (10) 


B——____> Sh(B). 


[This is the same diagram as (8) in §5; the horizontal inclusions are the 
Yoneda embeddings; or alternatively, they come from identifying B and 
O(Stone(B)) with the lattices of subobjects of the terminal object 1 in 
the topoi Sh(B) and Sh(Stone(B)), respectively. ] 

The embedding i: Sh(B) — Sh(Stone(B)) of (8) has the following 
special property, which will play a crucial role in our proof in the next 
section of Deligne’s theorem. 


Lemma 4. The direct image functor i,: Sh(B) — Sh(Stone(B)) of 
the embedding (8) preserves finite epimorphic families. 


Proof: We are to consider a finite epimorphic family 
{an | Fe > F yeaa (11) 


of sheaves Fi, and F in Sh(B). Now it follows from Corollary III.7.6 (in 
exactly the same way as Corollary III.7.7 does) that this family (11) is 
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locally surjective, in the expected sense: Given any element b € B [an 
object in the site for Sh(B)] together with an element x € F(b), there 
is a (possibly infinite) cover b = \/,-; b; of 6 and for each index i € I of 
the cover some index k; € {1,...,n} together with an element v, such 
that 

vi © Fy,(b;) = and —s ag, (uj) = aly. 


Now the given family (11) is finite, and we will first show for each 
such « € F(b) that there is a finite list c),...,c, of elements of B, 
together with elements z, € Fi,(c,) fork =1,...,n, such that 


b= V-+- Ven and On (Ze) = 2|cp (k=1,...,n). (12) 
First, we define cz as 
ce = \/{ 0; |i € I such that k, =k} (13) 


so that the first equation of (12) does hold. Next, introduce for each 
index k the following pullback P, in Sh(B): 


P,, ———— F, 


al |e (14) 


y(cx) ———> F.. 


Here the bottom arrow y(c,) — F is the one corresponding by the 
Yoneda lemma to the element zc, € F'(cx). This makes the sheaf P, 
explicit: for each element a € B one has P,(a) = @ if a £ cz, and 


P,(a) = {vu | v € Fy(a) and a,(v) = cla} (15) 


if a < cx. 

Next, the left-hand map 6: P, — y(ck) in (14) is locally surjective. 
Indeed, take any element of y(c,) = Hom(-— ,cx); that is, an element 
a<cx in B. According to (13) this element of B has a cover 


a=aNcy=\/{aAb,|i€ I such that k, = k}. 


We wish to show that the element (2 Ac, — cx) is in the image of 
the map ( in (14). But for each index i with k, = k we are given an 
element v, € Fy, (bi) for which a4 (v;) = 2|b,; its restriction to a Ab, thus 
is an element vj = v;|(@ A d;) with ag(vj) = z|(a Ad;). Hence by (15) 
vu, € P,(a) is an element mapped by (; to (a A ch > cx) in y(c,). This 
proves that (, is indeed locally surjective, hence is an epimorphism in 


11. Deligne’s Theorem 519 


Sh(B) [cf. Corollary III.7.6]. But by Proposition VI.1.8, epimorphisms 
split in Sh(B), so there is a section og: y(ck) — Px of By. When we 
apply this section to the identity element (c, — cx) of y(cx) we obtain 
an element z,% € Fi(cx) with o4(z,) = z|cx, as desired for (12). 

We can now conclude the proof of the lemma by showing that 


{tx(k) | te(Fk) > i(F) fem 


is an epimorphism of sheaves on the Stone space of B. According to 
Proposition II.6.6 describing epimorphisms in sheaves it will suffice to 
show for each point (maximal filter) m of Stone(B) that the associated 
maps of stalks 


(t«(@z))m? (é4(Fe))m > (ée(F))m (R= 1,...50) (16) 


constitute an epimorphic family in Sets. To this end, pick an element in 
the image stalk (7.(/))m; it must have the form germ,,(x) for some x € 
i,(F)(D(b)), where D(b) for some 6 € B is a basic open neighborhood 
of m. But i,(F)D(6) = F(b) by the definition (9) of i,(F) and by the 
identity (7). Hence, we have x € F(b). Therefore by (12) there are 
elements c;,...,;Cn € B and for each k = 1,...,n elements z, € F(cx) 
with a,(zh) = z|ce. Then D(b) = D(cy) U...U D(c,) by Lemma 2(ii), 
so since m € D(b) we must have m € D(c,) for at least one index k. 
Then for this k, 


germ,, (x) = germ,, (z|c,) = germ, (ax (Zx)) 


is in the image of the corresponding k** map i,(F,) > i.(F). Thus, at 
each point m of the Stone space of B, the n maps of stalks in (16) are 
jointly surjective. This completes the proof of the lemma. 
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A coherent topos is a topos for which there is a site (C, K) where C 
is a category with finite limits and the Grothendieck topology is given 
by a basis K (Def. III.2.2) which consists of finite covering families. Let 
us call such a site “of finite type”. Many of the topoi arising in algebraic 
geometry are coherent; for example the Zariski topos is coherent since 
the site given for it in §III.3 is evidently of finite type. Also, the classify- 
ing topos of any geometric theory is coherent, since the “syntactic site” 
to be constructed for any such theory in §X.5 is of finite type. (Observe, 
however, that even if a topos € is coherent, there are many sites for € 
which are not of finite type, as will be apparent from Giraud’s theorem; 
cf. in particular Corollary 4.1 of the Appendix.) 
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Theorem 1. Let B be a complete Boolean algebra, with cor- 
responding embedding i: Sh(B)—Sh(Stone(B)) as in §10(8) above. 
Then any geometric morphism from Sh(B) into a coherent topos can 
be extended to Sh(Stone(B)). 


This theorem asserts that for any geometric morphism f: Sh(B) > 
€, where € is coherent, there exists a geometric morphism g such that 
the following diagram commutes, up to natural isomorphism: 


sh(B)=—2—+€ 
an 
| “9 (1) 


a 


Sh(Stone(B)). 


Proof: Let € be a coherent topos, so that € = Sh(C, K) where 
C is a category with finite limits and K is a basis for a Grothendieck 
topology for which all covering families are finite. Let f: Sh(B) — € be 
any geometric morphism. As in Corollary VII.9.4, f corresponds to a 
left exact continuous functor A: C — Sh(B) for which there is a natural 
isomorphism 
§.(G@)(C) = Hom(A(C), G) (2) 
for each sheaf G and each object Cin C. Consider the composite functor 
A’ =i, 0 A: C > Sh(Stone(B)). Clearly A’ is left exact since both i, 
and A are. Moreover since C is of finite type and i, preserves finite 
epimorphic families (Lemma 10.4), it follows that A’ is continuous since 
A is. Thus, again by the results of Chapter VII, the functor A’ defines 
a geometric morphism 


g: Sh(Stone(B)) > € = Sh(C, K), (3) 


with the inverse image g* given by tensoring with A’ while the direct 
image g, is given, for each sheaf F' on the Stone space and each object 
C € C, as the Hom-functor 


g«(F)(C) = Hom(A’(C), F). (4) 


In particular, if G is any sheaf on B and i,G the corresponding sheaf 
on Stone(B), there are natural isomorphisms 


(by (4)) 
( (since i* 4 i,) 

= Hom(i*i, A(C), G) (by definition of A’) 
( G 
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the latter since 7 is an embedding of topoi so that i*i, = id; cf. §VII.4. 
Hence by the description (2) above of f as a Hom-functor there is a 
natural isomorphism 9, 0i, = f,. This shows that the diagram (1) 
commutes, up to natural isomorphism, and so completes the proof of 
the theorem. 


Corollary 2. For any coherent topos €, there exists a complete 
Boolean algebra B and a surjective geometric morphism 


Sh(Stone(B)) > €. 


Proof: Given €, Barr’s Theorem 9.2 provides a_ surjection 
f: Sh(B) — € for a suitable complete Boolean algebra B. By the 
preceding theorem, this f can be extended as in (1) to a geometric 
morphism g: Sh(Stone(B)) — €. By the commutativity of (1), g is 
surjective since / is. 

Now recall that a point of a topos E is a geometric morphism Sets > 
E. A topos is said to have enough points if the collection of all points is 
“jointly surjective”; that is, for any two distinct arrows a, 3: ED in 
E, there is some point p: Sets — € such that p*(a) # p*(@). This can be 
expressed in different but equivalent ways, analogous to Lemma VII.4.3 
characterizing surjections. For instance, the topos € has enough points 
iff for any two subobjects A and B of a given object E in E, 


A<B iff p*(A) C p*(B) for any point p of €. (5) 


Clearly the topos Sh(T) of sheaves on any topological space T has 
enough points. [Indeed, if a, 8: E=D are distinct arrows in Sh(T), 
then for some actual point t € T the stalk maps a,, @,: E, — D, must be 
distinct; but these are the inverse images of a and @ under the geometric 
morphism Sets — Sh(T) given by the point t.] Also if f: F > Eisa 
surjective geometric morphism and ¥ has enough points, then so does 
€. From this remark and Corollary 2 we obtain: 


Corollary 3 (“Deligne’s theorem”). A coherent topos has 
enough points. 


With classifying topoi based on Gentzen’s rules as suggested at the 
end of §X.5, this corollary is essentially equivalent to Gédel’s Complete- 
ness Theorem for first-order logic. 


Exercises 


1. (a) For the lattice of open subsets of a topological space show 
that the infinite distributive law 
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2. 


3. 
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UvA\V=AUvM) 


[which is the dual of §1(1) ] need not hold. 

(b) Give examples to show that for a map f: S — T be- 
tween topological spaces, the homomorphism of frames 
f~!: O(T) — O(S) need not preserve infinite meets, nor 
implication or negation. Show that if f: S — T is any 
open map between topological spaces, then f—~!: O(T) — 
O(S) does preserve infinite meets and implication. 

(c) Fora map f: S — T between topological spaces, show that 
the map f,: O(S) — O(T) of §1(4a) (the right adjoint of 
f~!) can also be described by 


f.(U) =T- f(s -—V), U open in S. 


Give an example to show that this f, need not preserve sups. 

Give examples to show that not every T,-space is sober, nor is 
every sober space Jj. 
An atom in a complete Boolean algebra B is a nonzero element 
a € B such that for each b € B, if b < a, then either b = 0 or 
a=. Show that frame homomorphisms B — {0,1} correspond 
to atoms in B. [Thus every complete atomless Boolean algebra 
B defines a locale X without points by setting O(X) = B.] 


. A completely prime filter G on a frame A is a subset G of A such 


that (i) 1€ G and 0 ¢ G; (ii) aAb € G iff a € G and b € G; (iii) 
if Va, € G, then a, € G for some index i. 


(a) If T is a topological space, check that the set of all open 
neighborhoods of any point t € T is a completely prime 
filter on the frame O(T). 
For a locale X, show that the points of X correspond bijec- 
tively to the completely prime filters on the frame O(X). 
A topology on a set can be described by specifying 
only a basis for the open sets. In a similar way, one can 
define a locale by giving a presentation in terms of a poset 
equipped with a covering system, as in the following exer- 
cise. Examples are then provided by Exercises 6-8. 


(b 


— 


. Let P be a partially ordered set. A covering system on P is a 


function Cov which assigns to each p € P a family of subsets 
SC{qeEP|q<p}, the “covers of p”, in such a way that the 
following stability condition holds: 
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if S € Cov(p) and q < p then there exists a T € Cov(gq) 
with Vie Tas € S(t <s). 


(a) A sieve on P is a subset U C P such that q < pe U 
implies g € U; such a sieve is called closed if for any subset 
S CU, S € Cov(p) implies p € U. Show that for any 
given covering system Cov, the closed sieves on P form a 
frame, and describe explicitly the operations A, =, and = 
of supremum, implication, and negation in this frame. 

(b) For every p € P let B, be the smallest closed sieve con- 
taining p. Observe that these sieves B, form a “basis”, 
in the sense that every closed sieve S is a supremum of 
such basic sieves By. [If X is a locale such that O(X) is 
isomorphic to this frame of closed sieves, then (P, Cov) is 
called a presentation of X.] 

(c) Fora locale X, contemplate the relation between presenta- 
tions (P, Cov) of X and sites for the topos Sh(X). (Hint: 
use Giraud’s theorem from the Appendix.) 


6. Let P be the poset of finite sequences of zeros and ones, partially 
ordered by setting for u and v € P: 


u<v_ iff vis an initial segment of u. 


For u € P, let Cov(u) consist of a single family, namely the 
family {u~0,u~1 }, where ~ denotes concatenation. Prove that 
this defines a covering system, and that (P, Cov) thus defined is 
a presentation of the Cantor space 2N [or more precisely, of the 
locale Loc(2N) corresponding to the space 2]. 

7. Let P be the poset of rational intervals (p,q), where p < q, par- 
tially ordered by inclusion. For (p,g) € P, define Cov(p,q) to 
consist of two types of families: 


(i) {(p,7r), (8,49) } € Cov(p,q), whenever p<s<r< q 

(ii) { (Pas dn) | n € N} € Cov(p,q), whenever (p,,) is a de- 
scending sequence converging to p and (g,,) is an ascending 
sequence converging to q. 


Prove that this is a covering system on P, and that (P, Cov) 
thus defined is a presentation of the locale Loc(R) corresponding 
to the space of real numbers. 
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8. (“Killing points”) Let T be a topological space. Define a covering 
system Cov on the poset O(T) by setting, for any family of open 
subsets U; C U, 


{U,|ieI}eCov(V) iffU- (JU; is finite. 
ier 


(a) Check that this is indeed a covering system. 

(b) Write K(T) for the locale presented by (O(T), Cov), so 
that O(K(T)) is the frame of closed sieves for this covering 
system. Prove that O(K(T)) is isomorphic to the lattice 
of those open sets of T which have a perfect complement. 
(A closed set is called perfect if it has no isolated points.) 

(c) Show that K(T) is a sublocale of the locale Loc(T) corre- 
sponding to the space T. Show that K(T) is in fact the 
largest sublocale of Loc(T) which doesn’t have any points. 


9. A map f: Y — X between locales is called a local homeomor- 
phism, or an étale map, if there is a family {V; | i € I} of 
elements of O(Y) such that V/V; = 1 [the top element of O(Y)] 
and such that for each i there is a U; € O(X) such that f restricts 
to an isomorphism between open sublocales for each index i, as 
in 


Vi ---22-9 U; 


| | 


Show that any étale map is open, and that the composition of 
two étale maps is again étale. Also show that if f: Y — X and 
g: Z — Y are maps such that f and f 0g are étale, then g must 
also be étale. 


The purpose of the next three exercises is to prove an equivalence of 
categories between étale bundles and sheaves in the context of locales, 
analogous to the case of spaces discussed in Chapter II. 


10. Let f: Y ~ X bea map of locales. For U € O(X), let S;(U) be 
the set of sections of f; that is, if we view U as an open sublocale 
of X with embedding i: U— X, then S;(U) is the set of maps 
3: U > Y with fos =i. Prove that S> is a sheaf on the locale 
X. 

11. Let F: O(X)°P -> Sets be a sheaf on a locale X. Let P be the 
poset of pairs (U,s) where U € O(X) and s € F(U), partially 
ordered by 


(U,s) <(V,t) if U<V andt|U =s; 
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12. 


13. 


14. 


in other words, P is the category of elements of F; cf. §1.5. Define 
a covering system Cov on P as follows: for a family { (U;,s;) | 
i€JI} with (U;,8;) < (U,s) for alli 


{(Ui,si) |t€ 1} €Cov(U,s) if U=\/Uj. 


Check that this is indeed a covering system. Write E(F) for 
the locale of which (P, Cov) is a presentation. [So elements of 
O(E(S)) are closed sieves on P, as in Exercise 5.] Show that 
there is a canonical étale map mp: E(F) > X, with rp7!(U) = 
{(V,t) |V <U}. Also show that this is natural in F. 

With the notation of the preceding two exercises, prove that for 
any sheaf F' on X, there is a natural isomorphism between F 
and the sheaf S;,,) of sections of m7: E(F) — X. Also prove 
that for any étale map f: Y — X between locales, there is an 
isomorphism E(S;) = Y of locales over X. Conclude that the 
category of sheaves on X is equivalent to the full subcategory of 
(Locales)/X consisting of étale maps into X. 

Let f: F — E and g: G — F be geometric morphisms. Check 
that if f and g are open then so is their composite fog. Also 
show that if g is a surjection and f og is open, then f is open. 
For a cocomplete topos € and Loc€ as in §5(6), let 7: € — 
Sh(Loc(€)) be the unit of the adjunction of Proposition 5.3. 


(a) Show that for any E € € and any U € O(Loc(€)) = 
Sub¢(1), 
n«(E£)(U) = Home (E,U). 
(b) Conclude that the canonical map 
A: QD (Qe) 


is an isomorphism. (Here Q¢ is the subobject classifier of 
E and 2 that of Sh(Loc(€)); 2 is the transpose of the map 
r of §6(21).] 

Conclude that for each sheaf F' on Loc(€), the inverse 
image functor 7* induces an isomorphism 


— 
io) 
~— 


Sub(F’) —+ Sub(n*F) 


of subobject lattices. (Geometric morphisms with this 
property are sometimes called hyperconnected.) 

(d) Conclude that the geometric morphism 79: € — 
Sh(Loc(€)) is an open surjection. 


X 


Geometric Logic and 
Classifying Topoi 


A first-order formula ¢(21,...,2n) is called “geometric” if it is built 
up from atomic formulas by using conjunction, disjunction, and exis- 
tential quantification. Geometric logic is the logic of the implications 
between geometric formulas: 


Vz (d(x) > $(2)), (1) 


where the arrow here is for “implication” and ¢ and w are geometric. 
Many mathematical structures can be axiomatized by formulas of this 
form (1). For instance, local rings are axiomatized by the usual equations 
for a commutative ring with unit, together with the axiom 


Ve,yE R(x+y=1— 3az(e@-2=1) Vv az(y-z=1)) (2) 


which states that the ring is local; this axiom (2) is indeed of the form 


(1). 

Much as for the Mitchell-Bénabou language in Chapter VI, we will 
explain how geometric formulas can be interpreted in topoi: an inter- 
pretation M in a topos € is essentially a rule which assigns to each 
geometric formula ¢(x) an object in €, denoted by {z | 6(z)}™. An 
axiom of the form (1) is “true” for an interpretation M if { x | d(x) }“ 
is a subobject of {x | o(x) }™. 

For instance, if one writes down the axioms for local rings in the 
form (1) with ¢ and 7 geometric, then all these axioms are true for a 
given interpretation M in a topos € precisely when M defines a local 
ring-object in €. 

In Chapter VIII, we constructed a classifying topos for local rings. 
In this chapter, we will show that a classifying topos exists for any kind 
of structure which can be axiomatized by formulas of the form (1). This 
will be proved in §6. As an application of this existence of a classifying 
topos, we will show in Corollary 7.2 that Deligne’s theorem from §IX.11 
implies that a geometric formula is true in all topoi iff it is true in Sets. 
This means that suitable classical theorems automatically carry over 
from Sets to topoi. 
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1. First-Order Theories 


In general, we start with a fixed first-order language L, possibly one 
with several sorts. Such a language is given by a collection of “sorts” 
(or “types”) X,Y,..., collections of relation symbols R,S,... and of 
function symbols f,g,..., and possibly some constants c,d,.... The 
relation symbols may include properties (unary relations). Each relation 
symbol is given together with the sorts of its arguments. For instance, 
R = R(z,y) could be a binary relation taking an argument zx of sort 
X and an argument y of sort Y, in which case we write “R C X x Y”. 
(This is purely suggestive; surely R is not really a subset of some product 
X x Y, since R, X and Y are just symbols of the language.) Similarly, 
each function symbol f of the language is given with the sorts of its 
arguments, and the sort of its output. We write suggestively 


f: X%x...x Xn OY 


if f takes n arguments of sorts X;,...,Xn respectively to a value of 
sort Y. Also, each constant c of the language is given with a specified 
sort. We write “c € X” orc: 1 — X to indicate that the constant c 
is of sort X. We also assume for each sort X that the language has 
infinitely many variables x1, r2,x23,... (or 2,y,Z,...) of that sort and 
we sometimes write “x € X” to indicate that x is a variable of sort X. 

With such a language L one can build up terms and formulas in the 
usual way: 

Terms (of sort X): Each variable or constant of sort X is a term 
of sort X; if t),...,t, are terms of sorts X;,...,X, respectively and 
f: X, x...x Xn Y is a function symbol, then f(f1,...,¢n) is a term 
of sort Y. 

Atomic Formulas: If R C X, x -:- x X, is a relation symbol 
taking n arguments of sorts X,,..., Xn while t;,...,, are terms of sorts 
X;,...,Xn respectively, then R(ti,...,f,) is an atomic formula; also if 
t and t’ are terms of the same sort Y then t = ?¢’ is an atomic formula; 
finally, the symbols T and | are atomic formulas (the identically true 
and false formulas). 

From such atomic formulas, one can build up more complicated for- 
mulas using connectives A, V, =>, 7, and quantifiers for any sort X 
(V2 € X, dx € X). In the standard way, occurrences of variables gov- 
erned by quantifiers V, 3 are said to be bound; others are called free. 

An interpretation in Sets of such a first-order language L is a func- 
tion M which assigns to each sort X of L a set X“™); to each relation 
symbol R C X; X--: x Xn of L a subset R™) of the cartesian product 
Dou x x XL; to each function symbol f: X; x --- x X, — Y 
a function f™): Xi) xox KOO _, Y™); and to each constant c 
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of sort X an element c™) € X(™). In most texts on model theory an 
interpretation of L is usually called a (set-theoretic) L-structure. 

Given such an interpretation M, one can define for each term 
t(@1,...,2n) of sort Y, whose free variables are among the listed 2; 
of sort X;, a corresponding function 


t™ = 4! (ay,...,2,): XO tex KOO = yO), (1) 
and for each formula ¢(x1,...,0,) with free variables among the 2; of 


sort X; a corresponding subset 
{ (a1,..-50n) | @}™% Co XM xx KOM, (2) 


The definitions are given by induction on the construction of t (respec- 
tively of @) in the usual way, analogous to the treatment of the language 


of a topos in Chapter VI: For the case of a term t(z;,...,2n), one sets 

e if t = 2; (a variable) then the function t™(2,...,2n) is the 
projection me Kine X xO) > x™), 

e ift = c (aconstant of sort Y) then t™ (21,...,2n) is the composite 


of xi) x x XM) _.1 with @: 1 YO, 

e if t = f(ti,...,th) then t”(2,,...,an) is the composite of 
Gee ot): x) Met Ke es bee X see X 5 Ale and 
f™: yi) Xe x vy”? + y™), 


For the case of a formula ¢(21,...,2,) with free variables among 
those listed, one sets: 


e for ¢ an atomic formula of the form R(t;,...,tk), 
(a,,.--;@n) € {(a1,.-.,2n) | G}™ iff 
GY Gite) patty Gunes) eR: 
e for ¢ an atomic formula of the form t = t’, 
(@,,...,@n) € { (21,---;2n) | o}Y iff 
t! (Gy,.52300) = (aig cs sag) 
e for ¢ the formula T or _, 
{ (1,---,%n) | L}”% =@ and 
{(a1,.--;an) | TH! = XO x x XM, 
e for a conjunction ¢ A y, 


{(t1)---52n) [PAIN = 
{(21,---,¢n) | @}”¥ O{(a1,..-,2n) |b }™; 
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e the cases of disjunction ¢ V y, implication ¢ = w, and negation 
a@ are treated analogously, using the corresponding operations 
on the Boolean algebra of subsets of X a Kr XK x” 
quantification: for a formula of the form Vr € X $(x1,...,2n,2); 
(a1,--.,€n) € {(@1,...,;2n) | Ve © X O(a1,...,2n,2) }™ iff for 
all b € X, it holds that 


(a1,-..,@n,6) € { (@1,..-,2n, 2) | o}™; 


and similarly for any existential quantification such as 
da € X O(a1,...,2n,2). 


For each formula $(x,...,2%,) this defines by induction a subset 
{(21,--.;2n)| @}"C xe x +++ x XM), Under permutation of the 
variables, the subset changes in the evident way. It should be noticed 
that this subset does not depend on the list 2,,...,2,, of variables, once 
this list contains all the free variables occurring in ¢, in the following 
sense: if one adds an element x,,4; to this list, the identity 


{ (Signi) | p}™ x Baa = {(r1,...,2n41) | o}™) (3) 


holds. Notice however that although the right-hand subset in (3) is de- 
fined in terms of { (21,...,2n) | ¢} by (3), one may not be able to 
recover the latter subset from { (21,..-,2n41) | ¢}™ since the interpre- 
tations xe of the sort Xn4, may be empty. This observation also 
requires care in formulating “rules of inference” for such a language. 

The formula ¢ is said to be valid in the interpretation M in Sets if 
for any sequence 1},...,£p of variables (x; of sort X;) such that all the 
free variables of ¢ are contained in this sequence, one has 


{ (e1s--+1@n) |O}4 = Xp x --- x XEN. (4) 


[Notice that if @ is a closed formula (i.e., @ does not contain any free 
variables), then ¢ is interpreted as a subset of the empty product 1.] 

A theory T in the language L is just a set of formulas, then called 
the axioms of T. A model of T is an interpretation M of L in which all 
axioms of T are valid. 

For example, the theory of abelian groups can be formulated in the 
language L containing one sort (X say), no relation symbols, two func- 
tion symbols +: X x X — X and —: X — X, and one constant 0 € X. 
An interpretation M of this language is given by a set S = X™) 
equipped with two functions +(): § x S$ > S and (-)): § = S, 
and one specified element 0) € S$. The theory T of abelian groups is 
the collection of axioms for abelian groups: (s+y)+z2=ae+(y+2), 
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er+y=y+a,2+0=2, 2+ (—2z) = 0. Such an interpretation S is a 
model of the theory of abelian groups exactly when the operations (a) 
and (~—)(“) define a group structure on the set S, with neutral element 
OoM ES. 

A homomorphism H: M — M' between two interpretations of a 
language L in Sets is given by functions 


Hy: XO) x), (5) 
one for each sort X, such that the interpretation of relation symbols, 
function symbols, and constants is respected. Thus for each relation 


symbol R C X, x--- x X, in the language L, the map Hx, x--- x Hx, 
must send R™) into R™"), as on the left in 


[Bast (6) 
RO ee KO i OO), 


Moreover, for each function symbol f: X; x --- x X, — Y and each 
constant c € X, the following diagrams should commute: 


XO) xx XO) Lyon 1 oe _, x (M) 
Hat | \" | be (7) 
peas Kc Ves Ve) een (Md 
foe) (MM) 


For a theory T in a language L, a homomorphism H: M — M' of 
T-models is a homomorphism of L-interpretations, where M and M’ are 
both models of the theory T. This defines a category of T-models and 
homomorphisms between them. 

For example, for the theory T of abelian groups, a homomorphism 
of T-models is simply a group homomorphism. 


2. Models in Topoi 


For a first-order language LD as in the previous section, there is an 
evident extension of the notion of an interpretation of L in Sets to that 
of an interpretation in any given topos €. Such an interpretation M in 
E is given by an object X“) of € for each sort X of L, a subobject 
ROY) xe x +++ x X™ for each relation symbol R CX, x---x Xy 
of L, an arrow f(™): 5 ei xe x KM) _, YO in € for each function 
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symbol f: X, x-+- x X, > Y of L, and an arrow c™): 1 = X™) (a 
global section) for each constant c of sort X in L. 

Given such an interpretation M of LD in a topos €, one can define 
for each term t(z1,...,2n) of sort Y, with free variables among the x; 
of sort X;, an arrow 


Meg” xe x XMM) __, y(™) (1) 


exactly as in the case of sets treated in the previous section. Next, one 
defines for each formula ¢(21,...,£,) with free variables among the z; 
(of sort X;) a subobject in €: 


{(a1,.-,2n) | 6}! © XP x. x XM, (2) 


by induction on ¢, again much as in the case of sets, and similar to 
the treatment of the Mitchell-Bénabou language in Chapter VI. If ¢ 
is atomic, say ¢ is the formula t(7,...,2,) = t'(a1,...,2n), then the 
subobject { (a1,..-,¢n) | t = t }™ is the equalizer of the arrows t(™) 
and t/™) in €: 


{(@1y.--52n) [t= t pM XO) x x XO) — yO, (3) 


If ¢ is R(ti,...,t) for a relation symbol R and terms ¢; of sort Y, 
(each with free variables among 27,...,2, of sorts X),...,Xn), then 
{ (21,-.-,2n) | R(tr,-.-,t~) }¥ is the pullback of the given subobject 
R™) along CE sagt \: 


{ (t1y.++52n) | Rts... 4t,) #4! —————> R™) 


| |e 


xe) x Be Sit x XM) es a x Pe x ye, 
(ty ste ) 
Finally, { (21,...,2n) | L}@9 and {(21,...,2n) | T}@” are the top 
and bottom elements of the Heyting algebra of all the subobjects of 
x™) K+ X x”™) 
1 nos 
The connectives A, V, =>, — are interpreted using the corresponding 
operations of that Heyting algebra; for instance, for A one defines 


{ (a1,.--)tn) | 6A Y}™ 
= {(r1,...,¢n) | ¢}¥ A{(a1,...,¢n) | v}™. (5) 


Finally, as for the quantifiers, recall that for any arrow a: E’ > E in 
a topos €, the “inverse image” a~!: Sub(EZ) > Sub(E’) has left and 
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right adjoints 4, and V,: Sub(£’) — Sub(£). Thus we can interpret 
the quantifiers of the language L by these adjoints: 


{(21,..-,2n) | V2 € X b(a1,...,2n, 2) }” 
= Va({(215-.-; fn, 2) | O(@1,-.-,2n52) Pe), (6) 


where 7: x) xe x XO) y XM _, xe) xe x XM) is the 
projection; and similarly for the existential quantification. 

This process defines for each formula ¢(21,...,¢n) a subobject 
{(21,...,2n) | @}” of the product x x +» x XM), One eas- 
ily checks that it is again independent of the choice of the sequence 
£1,+++;£n which contains all the free variables in ¢, as expressed in one 
case by the identity (3) of §1. As before, a formula ¢ is said to be 
valid in the interpretation M (in the topos €) if {(a,...,¢n) | 6 }“ 
is the maximal subobject X ws) Kee xe) itself, for every sequence 
Z1,+-.,2n containing the free variables of ¢. It is enough to test this for 
just a minimal such sequence. For a theory T in a language L, a model 
of T (or T-model) in a topos € is an interpretation M of this language 
in € such that all axioms of T are valid in M. For example, a model 
of the theory of abelian groups (see §1) in a topos € is nothing but an 
abelian group object in €. 

As in the case of interpretations in Sets discussed in the previous sec- 
tion, there is an evident notion of homomorphism H: M — M' between 
two interpretations of a language I in a topos €; such a homomorphism 
is given by arrows Hx: X(™) | xX in E, one for each sort X, re- 
specting the interpretation of relation and function symbols as well as 
that of constants. [This is expressed by commutative diagrams in €, of 
the same form as the diagrams (6) and (7) of §1.] This definition yields 
a category of all the interpretations of L in €. Just as in the case of 
Sets, each theory T in the language L gives rise to a full subcategory of 
this category of interpretations: the category 


Mod(T, €) (7) 
of T-models in €. 
Notice that if F: € — F is any left-exact functor, then each interpre- 


tation M of the language L in € can be transported to an interpretation 
F(M) in F, defined on sorts by 


XF(™) — F(X™)), (for each sort X of L) (8) 


as follows. If R C X, x--- x Xy is a relation symbol of L, its interpreta- 
tion in € is a subobject R) C Xi) x... x XM. Since F preserves 
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products and monomorphisms, one can thus define the interpretation 
R(™)) of R in the topos ¥ simply by applying F to R™): 


RF) = F(R™)) C 5 ere we, ohare 


The same procedure works for function symbols f and constants 
c; thus for a function symbol f, its interpretation f*(”) in F is the 
unique horizontal arrow below such that the square 


(M) 
F(xX™) xe x XM) A), Fy) 


F(X!) x. x F(X) 


XP yo. x KF) rm yr) 


PMY [8 
Foc) 
X((™)) in F is defined via (8) as the composition 1 & F(1) ———> 


F(X) = xP), 

In this way, any left-exact functor F': € — F gives rise to a functor 
from the category of L-interpretations in € to that of L-interpretations 
in ¥. But for a theory T in the language L, this functor does not in 
general restrict to a functor 


commutes; and for a constant c € X, its interpretation c 


F: Mod(T, €) > Mod(T, F); 


there is no reason why the validity of the axioms of T in the L-structure 
M in € should imply their validity in the induced L-structure F(M) in 
F. In the next section, we will describe an important class of theories 
T with the property that, for any geometric morphism f: F —- €, the 
inverse image functor f*: € + F does send T-models in € to T-models 
in F. 


3. Geometric Theories 


In this section L is a fixed first-order language. If f: F — Eisa 
geometric morphism, then (as at the end of the previous section) the 
inverse image functor f* yields for every interpretation M of L in the 
topos € an interpretation f*M of L in the topos ¥. For a formula 
$(21,.--,2n) Of the language L with free variables x; of sorts X;, there 
is thus a subobject in € 
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by applying the inverse image functor f* to this abi and Pri 
at ) with F = f*, we obtain a subobject of XU") x... x x("™ 


FP({ (wry sm) GYM) S F(R x + x XL) 
P(X) xx POM) 2) 
= XP) x. XM), 
On the other hand, one can also use the interpretation f*M in F to 
obtain a possibly different subobject in F: 
{(e1,---s20) |@ PO! CXL) x x KYM, (3) 


Theorem 1. Let f: F — € be a geometric morphism and let M be 
an interpretation of L in €, with induced interpretation f*M in F. If f 
is open, then for any formula ¢(21,...,2n) as above, 


FA Gr,.-5tn) OP) ={ (r,s) Oh ™, (4) 
[This is an equality of subobjects of XL ™)) Kee xX xy ae isomorphic 
«py (M py (M 

by §2(8) to ft(X{") x ++ x f(Xn),] 
Proof: The proof is by induction on the construction of the formula 
@. First, consider a term t(21,...,2n) of sort Y with free variables 
among the listed x; of sort X;. Its interpretation in M is an arrow 
1); xi") x-++x XM) _, yD in the topos €. Using the fact that f* 
preserves products, an easy induction on the term ¢ (that is, an induction 


on the construction of the term ¢ from variables, constants, and function 
symbols) now shows that the square 


* 7(M) 
foe ere x X(M)) FEO), pay (Y (M)) 


| | 


x M) xe x XM) _ pF") 
i n pF") 


commutes. Since f* preserves pullbacks and equalizers, it then follows 
immediately from the definition that for atomic formulas R(t1,...,tx) 
or t= t’ (with free variables among 2,...,2), the identities 


f*({ (e1,-.-5a@n) | R(tr,.--, te) }) = {(an,.--,2n) | R(ti,---, te) J! 
F*({ (21,---,2n) |t=t'}) = {(z1,---,2n) [e=ey me 

both hold. Also, since f* preserves the smallest and largest elements of 

the subobject lattice Sub(x(™) ite KM dy, we have 


f*({ (@1,+-+52n) | ih (Giga tn | Lyre), 
FC ay -6y Bn) | TIM) = (ry. tn | TPO. 
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Thus, the theorem holds for all atomic formulas ¢. We now proceed 
by induction on ¢, according to the definition of the class of formulas. 
For example, if the theorem holds for formulas ¢ and w, then it also 
holds for their conjunction. This follows because f* preserves meets of 
subobjects; more precisely, for any two subobjects A and B of any object 
E of E, one has 

F*(ANB) = f*(A) A f*(B); 
as subobjects of f*(E), because f* is left exact. Similarly, if the theorem 
is true for formulas ¢ and y, then it is also true for their disjunction 
¢@Vw. This follows similarly since f* preserves suprema of subobjects 
of any object E: 

FAV B) = f(A) V f"(B). 


Indeed, AV B C E is defined as the image of A+ B — E, and f* 
preserves coproducts as well as images. Furthermore, if the theorem 
holds for a formula ¢, then it also holds for the formula dz € X ¢. In 
fact f* preserves existential quantification along any map; i.e., for any 
arrow a: & — E’ in E and any subobject A C E, 


F*(AaA) = Ay+(ay f*(A) 


[cf. also [X.6(14)]. Indeed, by definition, 3,A is the image of the com- 
posite A — FE — E’, and f* preserves images. 

So far, we have not used the assumption that the geometric mor- 
phism f: F — €E is open. This assumption is needed for the other 
inductive clauses, concerning implication ¢ > y, negation 7@, and uni- 
versal quantification. Indeed, these follow in a similar way as in the 
cases of disjunction, conjunction, and existential quantification already 
treated from the following lemma, which thus completes the proof of 
Theorem 1. 


Lemma 2. For an open geometric morphism f: F — €, and for 
any arrow a: E — E’ in € and any subobjects A and B of E: 


(i) f*(VaA) = Vpe(ay (fA); 
(ii) f*(A => B) = f*(A) = f*(B); 
(iti) f*(4A) = >f*(A). 


Proof: (i) holds by Theorem [X.6.3 and (ii) follows from (i), since 
the implication A = B can be expressed in terms of universal quantifi- 
cation as 

(A => B)=V.(AA B), 


where a: A> E is the inclusion of the subobject A of E, and AA B is 
viewed as a subobject of A. Indeed, for any other subobject C of E, one 
has 

C<SVa(ANB) iffa7l(C)< ANB 
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by the adjunction a7! 4 V,. But a!(C) = AAC and AAC < AAB 
if ANC < Biff C < (A => B). Thus, for any subobject C we have 
C < Va(A A B) iff C < (A = B), and hence Va(AA B) = A => B, 
as claimed. Finally, (iii) follows from (ii) since A = (A => 0) and f* 
preserves the initial object 0. 


Remark 3. The converse of Theorem 1 also holds: if f: F = Eisa 
geometric morphism with the property that, for any first-order language 
L, any interpretation M of L in € and any formula ¢(2,...,¢,) with 
free variables among those listed, the identity (4) holds, then f must be 
open, at least when E is cocomplete. This follows from Theorem IX.6.3 
(open geometric morphisms preserve universal quantification). For con- 
sider a language with just two sorts X and Y, one function symbol 
g: X — Y and one unary relation symbol R C X. Then any dia- 


gram in € of the form A C EF’ + Eis an interpretation in E of this 
special language, and V.(A) = {y | ¢(y) }” where ¢(y) is the formula 
Va (g(x) = y > R(x)). The identity (4) for this formula ¢(y) then yields 
that f*(VaA) = Vy-(a)(f*A), so that Theorem IX.6.3 applies. 


By Theorem 1 a sentence ¢ which is valid in an interpretation M in 
the topos € remains valid in the induced interpretation f*(M) in F. In 
particular, if M is a model of a theory T in the topos €, then f*(M) 
will be a model of T in F¥. Thus, 


Corollary 4. For any theory T in the language L, any open ge- 
ometric morphism f: F — E€ induces a functor between categories of 
T-models, 


f*: Mod(T, €) > Mod(T, F). 


Theorem 1 is not true for an arbitrary geometric morphism f: F > 
€; we will shortly give an example in (13) below. It will be useful, 
however, to select a class of formulas ¢, the so-called geometric formulas, 
such that equation (4) holds for any geometric morphism f, open or not. 

A formula ¢ of the first-order language L is said to be geometric if 
it can be obtained from atomic formulas by conjunction A, disjunction 
V, and existential quantification dz € X. More precisely, the collection 
of geometric formulas is the smallest collection of formulas such that 


(a) the atomic formulas R(t;,...,t,), t= t’, T, and 1 are all geo- 
metric formulas; 

(b) if d and y are geometric formulas, then so are ¢ V p and ¢ A y; 

(c) if d(a1,...,2n) is a geometric formula, then so is the formula 
dae € X o(21,..-,%n), where X is any sort and z is a variable of 
that sort. 
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Theorem 5. Let f: F — € be any geometric morphism, let M be 
an interpretation of the language L in €, and let f*M be the induced 
interpretation in ¥. Then for any geometric formula $(21,...,2n), 


PRE (@as-++s tm) 1H) = (1-1 ttn) | GPM (5) 


[where equality is that as subobjects of xf AY ser ag geet MY es 
F(X x oe XPM. 
Proof: The proof is by induction on the construction of the formula 


@, and is identical to the first part of the proof of Theorem 1 (which did 
not use the assumption that f is open). 


A theory T in the language L is said to be a geometric theory if all 
its axioms are of the form 


Vary .».Win (O(21,+--,2n) > U(21,---,2n)), (6) 
where ¢@ and y are geometric formulas, with free variables among the 
listed 21,...,2n- 


Corollary 6. For a geometric theory T, each geometric morphism 
f: F — € induces a functor f*: Mod(T,€) — Mod(T, F). 


Proof: Suppose that M is an L-structure in the topos € such that 
all axioms of T are valid in M. We have to show that the axioms of T 
are again valid in the induced structure f*M in ¥. Each such axiom is 
of the form (6), and such an axiom is valid in € iff 


{ (x1,.-+,0n) |G }™ < {(a1,.-., an) | y}M (7) 


as subobjects of X, x --- x X, (where X; is the sort of the variable x;). 
Since f* preserves inclusions between subobjects, (7) gives 


FL (e1,---5tn) | GIM) < PL (er, -- sn) | Y}™). (8) 


And since ¢ and y are geometric, Theorem 5 now yields 


{(@ry---53m) |G < { (ary... t0) PHM, (9) 


Then V2; ...Vin(¢ => w) is valid in f*M. This applies to each axiom 
of T, so f*M is a model of T in F whenever M is one in €. 

For example, consider the theory T of local rings. This theory can 
be formulated in a language L with one sort X, two function symbols + 
and -: X x X — X, and two constants 0 and 1. The axioms of T are, 
first of all, the usual ring axioms such as 


Va,y,zEX(a-(yt+z)=r-y+z-z), 
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etc., etc. These are all of the form: a stack of universal quantifiers, 
followed by an equation between two terms, i.e., axioms of the form 


Va, €X...VEn EX (t(a1,---, fn) = U(21,---52n)); (10) 


for suitable terms ¢ and t’. One can also write this axiom in an equivalent 
way as 


Vay €X...VEn EXT > t(ey, «015 2n) =F (iiarotn)): (11) 


This is clearly a geometric formula. Furthermore, as in VIII.6.1, there 
is the following additional axiom for local rings: 


Va € X (ay € X (x@-y=1) Vay € X ((1~2)-y=1)). 


This formula is equivalent to Ve € X (T => Aye X (x@-y=1)VAye 
X ((1~ £)-y =1)), which is geometric. So the theory T of local rings 
is a geometric theory. The category of T-models in a topos € is thus 
precisely the category LocRing(€) of local rings in €, as introduced in 
§VIII.6. Thus Corollary 6 above applied to this theory T implies for 
any geometric morphism f: F — € that its inverse image f* sends local 
rings R in € to local rings f*R in Ff, and this purely on the basis of the 
syntactic form of the axioms for local rings. 

As remarked, any “equational axiom” of the form (10) is equivalent 
to a geometric axiom (11). So any theory axiomatized by equations 
is a geometric theory. This includes the usual “algebraic” theories of 
monoids, commutative monoids, (abelian) groups, R-modules for a fixed 
ring R, chain complexes (use infinitely many sorts here), etc., etc. 

In Chapter VIII we also considered orders. These can be described 
in a language with one sort J, one relation symbol <, and two constants 
b and t, by the following geometric axioms [see (i)—(vi) of §VIII.8): 


VeelI(T >2a< 2), 
Va,yze€l(a@<yAy<z>2<2), 
Va,yel(rcyAys<r>r=y), 

Ve el(T > b<2rAzXt), 
(o=t) >, 
Va,yel(Ts>a<yVy<z). 


Thus as a special case of Corollary 6 we obtain Lemma 2 of §VIIL8, 

again purely on the basis of syntactic form of these axioms for orders. 
As a final example, consider the theory of fields; this theory can be 

formulated in the language used for commutative rings with unit, just as 
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the theory of local rings considered above. After the purely equational 
ring axioms, there are several possible ways to express the additional 
field axiom; for example, either of 


Va (xz =OV Ay (zy = 1)), (12) 
Var (aay (ry = 1) > 2 = 0). (13) 


Of course, for a ring in Sets, these two axioms are equivalent and are 
true precisely when that ring is a field. Notice, however, that the first 
axiom (12) is of the form (6) required for geometric theories, at least 
when we rewrite it as the equivalent axiom 


Va(T = 2 =0V ay (zy = 1)); (14) 


but the second one (13) is not. For ring objects in topoi, axioms (12) 
and (13) are not equivalent. For example, consider the topos Sh(X) of 
sheaves on a Hausdorff space X, and in it the sheaf Rx of germs of 
real-valued continuous functions on X. This sheaf Rx is a ring-object 
in the topos Sh(X). Any point x € X gives a point of the topos Sh(X), 
ie., a geometric morphism z: Sets — Sh(X), and the inverse image 
x*(Rx), which is the stalk of Rx at «x, is the ring of germs of real- 
valued continuous functions at the point xz. In general, this is a local 
ring but not a field. So (14) is not true for z*(Rx) in general. By 
Corollary 6 it cannot be true for the ring Rx in the topos Sh(X) [so 
neither is the equivalent axiom (12)]. 

On the other hand, axiom (13) does hold for the ring Rx in Sh(X). 
Indeed, the subsheaf {x | Sy(zy = 1) }‘*x) of Rx is the sheaf of con- 
tinuous functions into R — {0}, so A = {x | aay (xy = 1)}"**) is the 
subsheaf of Rx given for each open set U of X and each continuous 
a: UR, by 


a €A(U) iff for no open subset V # @ of U does it hold that 
a(x) ~ 0 for each point x €V 
iff a} (0) is dense in U. 

Since X is assumed to be Hausdorff, it follows that a € A(U) iff the 
function a is identically zero on U. Thus A = {z | 2 = 0}**, hence 
(13) holds for Rx. 

Thus, the theory of fields in topoi can be axiomatized in various 
nonequivalent ways, one as a geometric theory [as in (12) or (14)], but 
others not [e.g., by axiom (13)]. 


4. Categories of Definable Objects 


By a “definable” object we wish to mean one of the form { x | ¢(x) }” 
for some geometric ¢. As a preliminary to the introduction of such 
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definable objects, we observe that the familiar correspondence between 
a function y = f(x) and its graph (in the plane) can be carried out in 
any topos €—or, for that matter, in any category € with finite limits. 
Indeed, in the cartesian x-y plane the graph of f: X — Y is the subset 
of X x Y consisting of all the pairs (x, fr) for  € X. Similarly, given 
an arrow s: A — B in a topos € its graph is defined to be the subobject 
of the product A x B represented by the mono (1,8): A — A x B. 
More generally, eh factorization of the arrow s through an isomorphism 


a: S$ Aas A—~ S —»+ B shows that the graph of s is represented 
by the mono m = (a, 8’): SA B, as in the figure 


Bets 
AX Graph(s): S Be AxB. (1) 
A 


Indeed, (1, s) is clearly isomorphic to (a, s’), as subobjects of A x B. 

Under this correspondence between arrows s: A — B and graphs 
S > Ax B, the composition of arrows can be expressed as a pullback of 
graphs. More explicitly, given a graph S of s and a graph T of t: B= C, 
we can combine the diagram (1) for s with a corresponding diagram for 
t, with an iso £, to get 


C7 2 og xt 


ST 


Here the upper right-hand square is a pullback over B. But the map 
B is iso. Hence, the pullback @’ of @ is iso and therefore the pullback 
S x pT, projected by the edges of (2) to A x C, is by (1) above the graph 
of the composite tf 0 s. 

Now change notation and let T be a fixed geometric theory. Our 
aim is to associate with any model M of T in any topos € a category 
Def(M) consisting of those objects and arrows in € which are definable 
by geometric formulas. We will write 


X= Xj,...,Xn 


for a finite list of sorts of the language. For each such list we have an 
object X™) of E, 
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as part of the data which specify the model M. An object of Def(M) 
is to be a pair (A,X), where X is such a list of sorts, while AX) 
is a subobject in € (that is, an equivalence class of monomorphisms) 


for which there is a geometric formula ¢(2,...,2n), with free variables 
among the 2,...,2n, such that 
A={x|¢(x)}", (3) 


as subobjects of X). Here, as usual, xz stands for the sequence 
(1,---,%n) of variables. 

If for a given subobject A— X™) such a geometric formula @ exists 
such that (3) holds, we say briefly that A is a definable subobject of X™). 
Notice that there may be many different formulas each of which witnesses 
that a given subobject A~ X™) is definable: As a trivial example, if 
A is definable by the formula ¢(x) as in (3), then it is also definable 
by the formula ¢(2’) = ¢(2/,...,x/,) obtained from ¢ by replacing the 
variables 21,...,2n by others r{,...,z/, of the same sorts, since, clearly, 


A={x| d(x) }¥ ={2' | d(a')}™. (4) 


In such a case, one sometimes says that ¢(x’) is an alphabetic variant of 
$(2), 

Next we use graphs to introduce definable arrows between two such 
definable objects (A,X) and (B,Y), where X is as above and Y = 
Y,,.-., Ym is another list of m sorts of the language. A definable arrow 


(s, X,Y): (A,X) — (B,Y) (5) 


[or briefly, s: (A, X) — (B,Y)] is to bean arrow s: A > Bin the topos € 
such that its graph S C Ax B, viewed as asubobject of X°)xY¥™) isa 
definable such subobject. In other words, there must be some geometric 
formula o(2, y) = o(21,..-,2n,Y1>+++; Ym) such that 


S= {(z, y) | o(2,y) y™ (6) 


is an equality between subobjects of X() x YM), 

With these objects (A, X) and these arrows s = (s, X,Y): (A,X) > 
(B,Y), one obtains a well-defined category Def(M), for which the 
identity-arrows and composition of arrows are given by identities and 
composition in the topos €, 

More explicitly, if (A, X) is an object of Def(M), so that A= {x | 
¢(x) }™ as in (3), then the identity arrow 1: A > A in € yields an arrow 


1 = (1, X, X): (A,X) + (A, X) 
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in Def(M), because its graph—the diagonal A, C A x A—is definable 
when viewed as a subobject of X(™) x X™), as in 


Aa = {(2,2') | 6(2) Ad(2') Nay =a, A---Atn =a, }%, (7) 


where x},...,2/, are new variables of the same sorts as 11,...,2n, Te 
spectively. To see that (7) holds, notice first that by §2(5) the right-hand 
side of (7) is 


{(x,2')| (a) } A{(a,2") | 6(2")} A{ (z,2") | 2 = 2"}, 
which by §2(3) is (A x X) A(X x A) A Ax = Ag, as desired for (7). 
Composition of arrows in the category Def(M) is constructed in a 
similar way, using the composition in the topos €. Explicitly, for objects 
(A, X), (B, Y), and (C, Z) of Def(M) given by geometric formulas ¢(x), 
wy), and y(z) respectively, the composite of two definable arrows 


(s, X,Y): (A,X) (B,Y) and (t,Y,Z): (B,Y) — (C,Z) 


is the arrow 
(tos, X,Z): (A,X) > (C,Z) 


given by the composite in € of s: A— B and t: B > C, which is again 
definable: if the formula o(z,y) defines (the graph of) s while 7(y, z) 
defines t, then the formula 


Ay (o(z,y) A7(y,2)), (8) 


which is again geometric if o and 7 are, defines the graph of tos as a 
subobject of X) x 7), To see this in detail, write 


S={(a,y) | o(z,y)}¥ CAx BCX xy™) 
for the graph of s and similarly 
T = {(y,2) | (yz) }™ 
for that of t. Now introduce the definable subobject 
R= {(zx,y,2) | o(x,y) Ar(y, 2)” (9) 
of Ax Bx CG XM) x YW) x Z7(™), By §2(5) , this R is 


R= {(2,y,2) | o(z,y) }¥ A {(e,y,2) | r(y,2) }™ 
= (8 x Z)) a (X xT), 
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as in the pullback 


Ro ee MY eT 


| ow 


5 eZ KO ey 20, 


This pullback R can also be constructed in the stages displayed by com- 
bining the descriptions of S' and T in the diagram 


 ————— ey | nee CO tera i 


a | 


Sx C———— A x Bx C+ X™) x Bx C (11) 


| | | 


Sx Z™),___, Ax Bx 7), KX) xy) x 7(™) 


where the upper left vertex R’ is defined as a pullback. Since the other 
three small squares are trivially pullbacks, the large outer square in 
(11) must also be a pullback. Thus, the pullback R’ in (11) must be 
isomorphic to the pullback R in (10), so R also fits into the pullback 


Ro Ax T 


jo (12) 


Sx Cr 7 AX BxC 


from the upper left square of (11). Here m and n are the given inclusions 
SCAxBandTCBxC. But now an easy “erasing identities” lemma 
shows that, for any diagram as on the left below, this diagram is a 
pullback iff the diagram on the right (obtained by erasing the identity 1 
in f x 1) is a pullback: 


P——+»UxM P——U 
ie Pb 
V WxM, V ———> W. 


Applying this twice to (12) presents R finally as the pullback in the 
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square 


A+ ——— SS Ax Be, 


Therefore, by the construction §2(6) for the existential quantifier as an 
image, as in Proposition IV.6.3, the subobject 


R" = { (a, z) | ay (o(a, y) Ar(y,2))}4% CAxC (15) 


is the image of R C Ax BxC under the projection Ax Bx C3 AxC 
indicated in (14), as in the factorization 


R»-————- Ax BxC 


| | 


R""——— Ax C. 


But by the construction (2) of the graph of a composite above, the 
pullback R = S xg T of (14) is exactly the graph of the composite t 0 s; 
in particular, R — Ax C is already monic, so that R = R”. This shows 
that the formula (8) above does indeed define the graph R” in (15) of the 
composite. So the composition of two definable arrows is again definable 
in our sense, and thus gives the composition (evidently associative) in 
the category Def(M). 

To summarize, for a model M of a geometric theory T in € we have 
constructed a category Def(M) whose objects are given by a list of sorts 
X = X,...,X,, and a definable subobject A CX), and whose arrows 
are the definable arrows of € between such subobjects; identity arrows 
and composition of arrows in Def(M) are defined just as identities and 
composition in €, since we have shown that identities are definable, and 
that compositions of definable arrows are definable. 

We emphasize that for each object (A,X) of Def(M) the list. of 
sorts X),...,Xpn must be explicitly given; while a formula of the lan- 
guage defining the subobject A must exist but is not uniquely given. In 
particular, the category Def(M) consists of certain pairs (A,X) where 
A © X™) is a subobject. It is not strictly a “subcategory” of the 
topos €, although we will intuitively think of Def(M) as if it were the 
“subcategory” of € consisting of definable objects and arrows. 
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One may construct a suitable “forgetful” functor 

Def(M) - €, (16) 
defined on objects as follows. An object (A,X) of Def(M) is given by 
a list X of sorts and a subobject of the corresponding X (M)__that is, by 
an equivalence class of monomorphisms A»—> X (M)_ Choose a particular 
monomorphism in each such equivalence class and then take the chosen 
object A as the value (A, X)) under the functor (16). This “forgetful” 
functor is clearly faithful. We will now prove that the category Def(/) 
has finite limits, and that the forgetful functor (16) preserves these. 


Lemma 1. For any model M of a geometric theory T in a topos 
€, the category Def(M) of definable objects and arrows has a terminal 
object, preserved by the forgetful functor (16). 


Proof: Take X = X,,...,Xn to be the empty sequence of sorts, for 
n = 0, so that X™) is the empty product; i.e., the terminal object 1 
of the topos €. Then for this sequence X, the pair (1, X) is an object 
of Def(M); indeed, the identically true formula T is a formula without 
free variables, and {- | T }™” is the terminal object 1 itself, so 1 is a 
definable subobject of itself. Furthermore, if (B,Y) is any other object 
of Def(M), where B— Y™) is a definable subobject, then the unique 
arrow B — 1 in € has a definable graph (this graph is the subobject B 
itself again). It follows that for the empty sequence X, the pair (1, X) 
is a terminal object of Def(M). 


Lemma 2. For any model M of a geometric theory T, the category 
Def(M) has pullbacks, and these pullbacks are preserved by the forgetful 
functor Def(M) — E of (16). 


Proof: The proof is long, but is really a straightforward matter of 
constructing the appropriate diagrams for the evident formulas. 
Consider pullbacks for a diagram of the form 


(A, X) —*— (C, Z)-+— (B,Y) (17) 
in the category Def(M), where X = Xj,...Xpn, Y = Y,...,¥m, and 
Z = Z,...,Z¢ are sequences of sorts as before, while the monomor- 


phisms 
:ASGX™M)) 5; BOY, kon 7 
represent subobjects which are definable by geometric formulas, say 


o(z), wy), and y(z). Moreover, in (17), s = (s,X,Z) and t = (£,Y, Z) 
are arrows in Def(M), so that the graphs of s: A— C andt: BC, 


Ar—@9)_, A x CRE XM) yx gM) 
(18) 


Br. B x C22 VM) y gi) 
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are definable, say by geometric formulas o(z,z) and r(y,z). We claim 
that the pullback of (17) in Def(M) can be constructed as 


(A xc B,(X,Y)) —* + (B,Y) 


| | (19) 


(A, X) ——,—— (C, Z), 


& 


where (X,Y) = (Xj,...,Xn,¥1,..-, Ym) is the concatenated list of sorts 
and A xc B is the pullback of s: A > C and t: B > C in € and is to 
be regarded as a subobject of XW”) x Y("), as in 


Axe B———> Ax B44 XM) x YO), (20) 


This result, once proved, shows that the forgetful functor Def(M) — E 
preserves pullbacks. 

To begin with, we prove that the square (19) is indeed a square in the 
category Def(M); in other words, that the subobject A xc B and the 
projections are definable by geometric formulas. Consider the (evidently 
definable) subobject R of X() x Y) given by 


R= {(z,y) | dz (o(a, 2) Ar(y,2)) }™. (21) 
We will prove the equality 
R=AxcB (22) 


of subobjects of XM) x YM), First, since o(z,z) defines the graph 
of s as in (18), it follows that {(z,y,z) | o(z,z)}™” is the subobject 
Ax YM), xX) x 7) x YO) & xX) x yO) x 7™) that is, 


(ix 1,ksm): Ax YO (XM x VOD) x ZM), (23) 
And similarly, { (x, y, z) | 7(y, z) }™ is the subobject 
(1 x j, ktma): X x Bol x) x yO) x 7), (24) 
Thus, by the rules of §2, the subobject {(z, y,z) | o(z,z) A r(y,z) }“ 
is the meet P of the two subobjects (23) and (24) as displayed in the 
pullback (25) below: 
Pe YUN 
wa lc xj, kta) (25) 


Ax Y™) —___, x) x y™) x 7M), 
(¢x1,ks71) 
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[In this diagram, the arrows (p, v) and (u,q) denote the projections from 
the pullback P.] Thus the projection of P on the first two factors is 


(p,q): Po XPD xy, 


Also the subobject R defined in (21) by the existential quantifier 4z is— 
according to the interpretation of such quantifiers—exactly the image of 
this map (p, q). 

We now “erase the identities” as in (13) above, erasing first the 
identity on X() and then the one on Y™) in (25) to find that the P, 
A, B, Z™)-square on the right below is a pullback: 


xX(™) y y(M)__ ey) 
SS | 
m ae (26) 
rn 
u C kt 
a" ~~N . 


Since k: C — Z™) is monic, it follows that the inner P, A, B, C- 
square (distorted) in (26) is also a pullback; in other words, there is an 
isomorphism 


(u,v): P—+AxcB. (27) 


But by the commutativity of (25), p = tu and q = jv so that 


and by (27) this is monic. 

Now by (21) the subobject R is the image of (p,q): P 3 X™) x 
yY(™). this shows first that P equals R as subobjects of X™) x YY), and 
second that P = R is identical to A xc B as subobjects of XO) x Y¥), 
This proves the desired equality (22). 

To finish the proof that (19) is a square in the category Def(M), 
we must show first of all that the projections 7;: A xo B — A and 
Tm: Axo B — B in € yield arrows in Def(M); that is, that the graph 
of ™, 


Axe BH (A xe B) x APD, (XO x YOD) x XCD (28) 


548 X. Geometric Logic and Classifying Topoi 


is a definable subobject, and similarly for 72. For the case of 71, we 
see first that the following diagram is a pullback since i: A ~ X™) igs 
monic: 


Ake —— Oe ae By A 


va] |oors (29) 
X(™) x Yaa CO sc VY 5 EMD 
171 


Therefore, the composite subobject (28) is the meet of the bottom and 
the right-hand subobjects of (X() x Y™)) x X™) in (29). Each of 
these is definable, respectively, as 


{(z,y,2')|e=2"}, 
{(x,y, 2’) | dz (o(a, z) A r(y,z)) A d(2’) }, 


[the latter by (21) and (22)], so that the composite subobject (28) is 
definable by the conjunction of these formulas. This shows that the first 
projection 7, in (19) is a definable arrow. The case of 72 is, of course, 
identical. Thus (19) is a square in Def(M). 

To complete the proof of the lemma, we finally show that (19) has 
the required universal property for a pullback in Def(M). To this end, 
consider another object (Z,W) of Def(M), where W = Wi,...,War 
is a list of sorts, while 2: E—W™) is a definable subobject, and let 
f: (E,W) — (A,X) and g: (E,W) — (B,Y) be arrows in Def(M) 
such that sf = tg there. Then in particular sf = tg: E — C as arrows 
in €, so that the universal property of the pullback in € gives a unique 
arrow ( f,g) as in 


| on 6: 


The point is to show that ( f,g) as constructed here represents an arrow 
(E,W) — (A xc B, (X,Y)) in Def(M); that is, that the graph 


E (f,9,1) (A XC B) x E txjxe x(™) x yo x wi (31) 


is a definable subobject. But the given arrows f: E > Aandg: EB 
have definable graphs, so that for suitable geometric formulas ¢(z, w) 
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and 7)(y, w) there are equalities as follows which hold between subobjects 
of X™) x W or of Y¥™) x WO: 


(BYP, Ax BBL, XO x OD) = { (2, w) | (2,0) 


(32a) 
(B22, Bx B22 VO) x WO) = { (y,w) | d(y,w) 
(32b) 
We now claim that 
{(x,y,w) | o(x,w) A dy, w) (33) 


is the graph of ( f,g); i.e., that (31) and (33) are identical subobjects of 
X™) xy) x Ww), Indeed, one readily checks by elementary diagram 
arguments that all squares in the following commutative diagram are 
pullbacks in &: 


ee x Ey) =" yO) XE 
an] [erro [erro 


Ax Ee A Bx ERS AX YOO XE 


(71,972,772) 1xjxl 


‘| lee [boxe 


X™) x By>—_—> X™) x Bx Er X™) xX YO) x pO), 

(1,972,572) 1xjxe (34) 
But, up to permutation of the product factors, the bottom compos- 
ite row is obtained by crossing (32b) with X) while the right-hand 
composite in (34) comes from crossing (32a) with Y). So these 
composites are definable as {(z,y,w) | v(y,w)}” and {(2,y,w) | 
o(z,w)}™, respectively. Their meet (33) therefore is the pullback 
E> X™) x yO) x WM given by the outside square of (34), this is 
exactly (31). This shows that (31) is a definable subobject, as desired, 
and so, at last, completes the proof of the lemma. 

By Lemmas 1 and 2, the category Def(M) thus inherits from € all 
finite limits. It also inherits a (basis for a) Grothendieck topology. In 
€ there is such a basis, where a cover is given by a finite epimorphic 
family. Thus, in Def(M) a finite family 


{ si: (Ai, X) > (B,Y) }2, (35) 
is a cover of the object (B,Y) of Def(M) when this family gives an 


epimorphic family in € under the forgetful functor (16); that is, when 
the induced map [],.A; > B is an epimorphism in €. It follows 
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readily that this indeed defines a basis for a Grothendieck topology on 
the category Def(M). For example, the stability axiom is satisfied on 
the basis of Lemma 2 and the fact (Chapter IV) that pulling back in a 
topos preserves epimorphisms and coproducts. 

One can make this more explicit as follows. Suppose for the ob- 
ject (B, Y) of Def(M) that the subobject B C Y™) is defined by some 
geometric formula 7(y) in the variables y1,..., yz, while each of the sub- 
objects A; C (X)™) js defined by a formula ¢;(z*) = ¢i(xj,...24,), 
for 1 = 1,...,m; furthermore, suppose that the graph S; of the given 
morphism s; is defined by a formula o;(x', y). Then the condition that 
this family (35) of arrows forms a cover in Def(M) is also “definable” 
by a geometric formula. This is stated in the following lemma, which 
will be needed to prove Lemma 6.2 below. 


Lemma 3. The above family (35) of definable arrows (with s; de- 
fined by o; and B by w) is a cover of the object (B,Y) for the indicated 
topology on Def(M) iff the formula 

Vy (W(y) > (Att oy (2",y) V+ V Ae om (2, y))) (36) 


holds for the model M in the topos E. 
Proof: The formula (36) holds in M iff the subobject 


B={y| dy) }¥ CYC) =H x. x XM 
is contained in the subobject 
{y| az? oi (2!,y) V+ V Ae™ om(2™, y) }™. (37) 
Since the graph S; of s, is defined by 
S1 = {(2",y) | or(a",y) 4% C (XM x), 
we know by the formula (6) of §2 for the existential quantifier that 
{y | de" or(x',y) #4 C XO 


is the image of the graph S, C A, x B under the projection A, x B > B. 
But the following commutative diagram 


S\;-———> A, x B 


A, —,,— B 
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shows that this image is exactly the image of the arrow s;: A; — B. The 
corresponding result holds for the graphs S2,...,Sm of the other given 
AITOWS $2,...,3m-. Therefore, by the interpretation of the disjunction 
from §2, the subobject (37) is exactly the supremum 


S = Im(s1) V--- VIm(sm) C B 


of the images of the various arrows s;. But this supremum S can also 
be presented as the image of the arrow A, +--- +A, — B induced on 
this coproduct by the given maps s;. Now this arrow is epi iff its image 
S contains all of B; that is, iff the geometric formula (36) of the lemma 
holds in the model M. 


Given a family of arrows {s;: A; ~ B}?, with a common codomain 
B in a topos €, one would like to construct an arrow f: B > E from 
any “matching” family of arrows f;: A; — E. When this is uniquely 
possible, one says that the given family { s; } is effective. In other words, 
the s; form an effective family when, given f;: A; —~ E fori=1,...,m 
for which all the elongated squares on pairs f,, f, 


Akg Aye Ay 


(38) 


commute, there exists a unique f: B — E, as shown, with fos; = f, 
for i=1,...,m. 


Lemma 4. In a topos € any finite epimorphic family is effective [in 
the sense of (38) above]. 


Proof: By definition, the arrows s; of the epimorphic family to- 
gether determine a single epimorphism s: [[;", A. > B in €. Since 
pullback commutes with sums, first over i and then over j, in a topos 
(Chapter IV, Theorem 7.2), the kernel-pair of this epimorphism s is 
LI; LI; 4: xe A;, with the resulting two maps to J A, as in the diagram 


LIL] 4: xs 4; = [ [4 —> 8. (39) 
t1j=1 i 


But in any topos, an epimorphism is the coequalizer of its kernel pair 
(Theorem IV.7.8), so (39) is a coequalizer. The given arrows f; as in 
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(38) together yield an arrow g: [| A; — E on the coproduct, and the 
commutativity of the outer squares in (38), for each i and j, show that 
this arrow g equalizes the parallel arrows in (39). Therefore, g factors 
as g = fos for a unique f: B — E. By composing with each of the 
coproduct inclusions A; — |] A; the identity g = fos yields the identity 
fos, = fi. 

We will now show that the Grothendieck topology on Def(M) as 
described in Lemma 3 is subcanonical in the sense of Chapter III; 
i.e., that for any object (E,Z) of Def(M) the representable presheaf 
Hom( — , (EZ, Z)) is a sheaf for this topology. 


Lemma 5. The Grothendieck topology on Def(M) is subcanonical. 


Proof: Let (£,Z) be any object from Def(M); so, as before, 
Z=Z,...,Z, is a sequence of sorts while E C Z™) is a subobject 
definable by a geometric formula, say as E = {z | x(z)}™”. To show 
that Hom( — , (£, Z)) is a sheaf, we must consider any covering family 


{ 83: (A,, X*) 3 (B,Y) }%,, (40) 


and show that a matching family of arrows f;: (Ai, X*) > (E, Z) deter- 
mines a unique arrow f: (B,Y) > (E,Z) in Def(M) with fos, = f,. 
Applying the forgetful functor Def(M/) — €, the covering family (40) 
yields an epimorphic family {s,: A, — B}in €, and the arrows f, yield 
a matching family in €. Thus the previous lemma shows that there is a 
unique arrow f: B > E in E€ such that fs, = f, for alli=1,...,m. It 
remains to be verified that this f also gives an arrow (B,Y) — (E,Z) 
in Def(M); in other words, that the graph of f is a definable subobject 
of ¥) x 7), To this end, let s,, A,, and B be defined by geometric 
formulas 


o,(z",y), o.(x’), vy) 


as before, and let f, be defined by some formula 7,(2’, z) fori = 1,...,m. 
We then claim that the unique arrow f is defined by the following for- 
mula, with free variables y, z: 
Ba! (o1(2",y) An(z’,z)) Vie V Ae™ (om(2™, y) A tm(2™,2)). (41) 
Indeed, for each index i = 1,...,m, consider the pullback 
S, Xa, &; ——— F, ——— E 


|} 


Be-——__ §, ——— A, 


5. Syntactic Sites 553 


of the graphs S; and F; of the given arrows s; and f,;. By the rule of §2 
for the existential quantifier, the subobject 


{ (y,2) | Sa" (o;(x*, y) A 7(x",z)) }IMCBxE 


is the image of S; x4, F; ~ B x E. But in the pullback square of the 
diagram just above, all the arrows are isomorphisms since F; and S; are 
graphs. Hence the image above is the same as the image of (s,, fi): A, > 
B x E suggested by the outer sides of the diagram. Moreover, for each 
index i the diagram 


A, ————> B 


aN Jan 


BxekE 


commutes, because fs; = f;, and this shows that Im(s;,f;) C 
Graph(f) = ((1,f): BB x E). Since [[ A; — B is epi, it follows 
that 

Im(s1, fi) V--: VIm(8m, fm) = Graph(f). 


This means that the formula (41) above, which defines the sup of the 
images Im(s;, f;), also defines the (graph of the) arrow f, as required. 
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Let T be a fixed geometric theory in a language L. In this sec- 
tion, we will construct a “syntactic” category B(T) equipped with a 
Grothendieck topology J(T). This category B(T) will be analogous to 
the category Def(M) of definable objects and arrows which was con- 
structed in the previous section from a model M in a topos E, while the 
Grothendieck topology J(T) on this category B(T) will be analogous 
to the topology on Def(M) which was given by epimorphic families of 
definable arrows, as in Lemma 4.3. However, unlike the case of Def(M), 
the construction of the category B(T) will not depend on any one specif- 
ically given model M in a topos €; rather, it will take all models M of 
T in all topoi € into account. (The meaning of “all” used here will be 
considered more closely in the remark at the end of this section.) 

An object of B(T) is thus given by sorts X),...,X, of the language 
and an equivalence class of geometric formulas 


[o(@1,...,2n)l, (1) 
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where ¢(z),...,%n) is a geometric formula with free variables among 
the 21,...2, of sorts X1,...,X,, respectively. Two such formulas 
$(21,...,2_) and @/(x},...,@,) are equivalent [i.e., they define the 
same object of B(T)] , when they both have their free variables among 
L1,..-,;Ln respectively z},...,x/, from the same list of sorts X1,...,Xn, 
and, moreover, when for any model M of T in any topos €, they define 
the same subobject of Xoo Xe X x™), in other words, 


{x | b(z)}™ = {a" | g(a’) }™ (2) 
must hold as an equality as subobjects of xe Xree X x, We 
will denote such an object of B(T) by [¢, X], where ¢ is the geometric 
formula and X is the associated list of sorts. 

To define the arrows in B(T), suppose given two objects [¢, X] and 
[w, Y] of B(T). Here we may assume that all the free variables 71,..., 2n 
occurring in ¢ are disjoint from all those y,,...,y% occurring in w, for 
@(21,...,£,) can always be replaced by a suitable alphabetical variant 
o(xr},...,2/,). Now arrows [¢, X] — [W, Y] are again equivalence classes 
[o;X,Y] of certain geometric formulas o(21,...,2p,Y1,---;Ym), with 
free variables among x; of sort X; and y; of sort Y;. In each model M 
of the theory T in a topos €, such a formula o(z, y) defines a subobject 


{(x,y) | o(a,y)}¥ XO) x VO (3) 


(where X™) stands for X{™)x...x KX)  ete., as before). By definition, 
this formula o(z,y) represents an arrow [o; X,Y]: [¢, X] — [W,Y] in 
B(T) when in every model M of T in any topos €, the subobject (3) is 
the graph of an arrow {x | ¢(x)}™“ — {y| v(y) }“. [So, in particular, 
this subobject {(z,y) | (x,y) } must be contained in the product 
subobject { x | d(x) }” x {y| v(y) }% of XM x Y) ] Furthermore, 
two such formulas define the same arrow in B(T), when in each model 
M of T in a topos €, these formulas define the graph of the same arrow 
{x | (x) }¥ > {y| vy) }™. 

To see that these definitions provide the objects and arrows of a cat- 
egory B(T), we must describe the identity arrows and the composition 
of arrows. For an object [¢, X] of B(T), the identity arrow is represented 
by the formula 


p(z, x’) = (o(z) A o(2’) At = Ty Nv Ain = Tn), (4) 


where x/,...,27, are new variables of the same sorts as the x;, exactly 
as in §4(7). For this p and any model M of T in a topos €, the object 
{ (x, x") | p(x, x") }™ is the graph of the identity arrow on {z | ¢(x) }¥. 
Thus, p(x, 2’) of (4) does define an arrow [¢, X] > [¢, X"] = [¢, X] in 
B(T). 
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As for composition, consider two arrows 
[o; X,Y]: [9X] >, Y], [5¥2Z]: WY) x, 2] 


for which [W(y), Y] = [v’(y’), Y]. Thus, in any model T the formulas 
w(y1,-+.5Yn) and w’(y},...,y/,) define the same subobject. After re- 
placing the variables in the sequence z occurring in r(y’, z) and y(z) by 
different ones, if necessary, all distinct from the variables in the sequence 
y, the formula 7(y,z) also defines an arrow [W,Y] = [W’, Y] — [y, Z]. 
Thus, in defining the composition of [o] and [7] we may assume that 7)(y) 
and y'(y’) are actually identical. The composite arrow [¢, X] — [x,Z] 
is then defined to be that represented by the formula 


Ay (a(x, y) A rly, 2)), (5) 


exactly as in §4(8). Indeed, we showed there that for any model M 
of T in any topos €, if o(2,y) and r(y,z) define the graphs of arrows 
{x | o(2)}” > fy | d(y) }” and {y | vy) }% — {2 | x(z) }™, then 
the formula (5) = §4(8) defines their composition in €. It follows that 
this formula represents an arrow [¢, X] — [x, Z] in B(T), and that this 
arrow depends only on the equivalence classes [o] and [7], and not on the 
particular formulas o and 7 which represent them. This composition is 
clearly associative, so that B(T) is a well-defined category. 

It also follows immediately from the discussion in §4 for any specific 
model M of the theory T in a topos € that there is an evident functor 


Fy: B(T) > Def(M), (6) 
defined on objects by 
Fr (Ip, X]) = {2 | o(2) CX, (7) 


as for arrows, an arrow [o;X,Y]: [¢,Y] — [w,Y] in B(T) defines the 
graph { (x,y) | o(z,y)}™ of an arrow {x | d(x) }¥ > {y | o(y)}¥ 
in €, and we define Fy¢([o(x,y)]) to be this arrow. Thus Fy is indeed 
a functor; ie., it preserves identities and composition, as is immediate 
from the explicit description above of identities and composition in B(T) 
and in §4 for the category Def(M). 

Notice also that these functors Fy,, for all models M of T in topoi, 
are jointly injective, in the sense that [¢, X] = [¢’, X’] as objects of B(T) 
iff for any model M of T in any topos, Fy4([¢, X]) = Fa ((¢’, X’]) (as 
subobjects of X(“)). The same applies to the arrows of B(T); so that 
the functors Fy for all models M are jointly faithful. 


Lemma 1. The syntactic category B(T) has all finite limits; more- 
over, for each model M of T in a topos €, the corresponding functor 
Fy: B(L) — Def(M) is left exact. 
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Proof: The construction of limits in B(T) is the same as that of 
limits in Lemmas 4.1 and 4.2 in the category Def(M) of definable objects 
of a given model M. 

The terminal object of B(T) is given by the identically true formula 
T (with no free variables). Indeed, in any model M of T in a topos E, the 
definable object { - | T }™ is the terminal object 1 of €; and for any other 
definable object [¢, X] of B(T) represented by a formula ¢(21,...,2n), 
the graph of the unique arrow {x | ¢(x)}“ 4 1= {-| T}” in € is 
definable by the same formula ¢(x). So [¢, X] is also the unique arrow 
[, X] > [T] in B(T). 

As for pullbacks in B(T), given arrows 


[o; X, 2]: [d, X] > [x, 2] — WY] [7 ¥, 2] 


in B(T), their pullback is the object of B(T) represented by the formula 
(x) Aw(y) A Az (o(z,z) A r(y,z)), and the projections are represented 
by the same formulas representing the projections 7 and 72 in the proof 
of Lemma 4.2. Just as for the case of the terminal object, it follows that 
these formulas give the pullback in B(T), since for each model M of T 
in a topos €, the same formulas give the pullback in Def(M). 

Finally, it is evident from the descriptions of limits in B(T), and 
from the corresponding explicit descriptions of limits in Def(/) given in 
the previous section, that each functor Fy: B(T) — Def(M) preserves 
finite limits. 


We now define a basis for a Grothendieck topology J(T) on this syn- 
tactic category B(T), as follows. A finite family {s;: A, ~ B}™, of 
arrows in B(T) is said to be a cover in B(T) when, for any model M of 
the theory T in any topos €, the corresponding functor Fy: B(T) > 
Def(M) sends this family to a cover in Def(M) as described in §4; 
equivalently, when the functor B(T) — € obtained by composing Fy 
with the forgetful functor Def(M) — € sends this family to an epimor- 
phic family in the topos €. We observe that J(Z) is indeed a basis for a 
Grothendieck topology on B(T): the transitivity axiom holds since the 
composition of epimorphic families is again epimorphic, while the sta- 
bility axiom holds, since the functor B(T) — Def(M) — € is left exact 
(Lemmas 4.1, 4.2, and 5.1), while the pullback of an epimorphic family 
in € is again epimorphic (because pulling back in a topos preserves epis 
and sums, see Theorem IV.7.2). 

Observe that if for ¢ = 1,...,n the arrow s;: A; — B in B(T) is 
represented by formulas o;(x’,y) for s;, ¢:i(z*) for A; and wy) for B, 
then by Lemma 4.3 the family { s;: A; ~ B}2., is a cover in the basis 
for the Grothendieck topology J(T) on B(T) iff in any model M of the 
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theory T in any topos €, the formula 


Vy (wy) > \V 3x* di(2*,y)) 


t=1 


holds. 
The following lemma is immediate from the definitions of the covers 
in Def(M) and B(T): 


Lemma 2. For any model M of the theory T in a topos €, the 
corresponding left exact functor Fy: B(T) — Def(M) preserves covers. 


By composing with the forgetful functor Def(M7) — €, which is 
left-exact and sends covers to epimorphic families, we thus obtain: 


Corollary 3. The composite functor B(T) — Def(M) — € is left 
exact and continuous. 


Furthermore, analogous to Lemma 4.4, we have 


Lemma 4. The Grothendieck topology J(T) on the syntactic cat- 
egory B(T) is subcanonical. 


Proof: Consider any covering family in B(T), say 
[o.;X*, Y]: [$i X"] > [y¥,Y], t=1,...,n, (8) 
as above. For any given family of arrows 
[713 X*, Z]: (61, X*] — [x,Z], i=1,...,n (9) 
in B(T) which match, in the sense that for any i and j the diagram 
[bis X*] X{x,2] [by X7] —— 3, X"] 
| fra (10) 
[di x] Ix; Z| 
commutes, we have to show, as in §4(40), that there exists a unique 
arrow [p;Y,Z]: [¥,Y] — [x,2Z] in B(T) such that [9]  [o,] = [7] for 
each i = 1,...,n. Consider any model M of the theory T in a topos 


€. By definition of the Grothendieck topology on B(T), the functor 
Fy: B(T) — Def(M) sends the covering family (8) to a covering family 


[743X*,Z] 


si: {2] o(")}™¥ > {y| vy) }% 9 (= 1,...,n) (11) 
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in Def(M), where we have written s; for the arrow (whose graph is) 
defined by the formula o;. Furthermore, the same functor Fy sends the 
matching family of arrows (9) to a matching family of arrows in Def(M), 


ti: {2° | bi(2*) }™ 5 {2 | x(z) }¥ 


which again match, as shown by the commutative squares in Def(M/) 
obtained by applying the functor Fy to the squares in (10). Thus, by 
Lemma 4.4, there is a unique arrow in Def(M) (or in €), 


wu {y| by) }Y = {2| x2)", 


such that wos, = t; for each i= 1,...,n, and this arrow is defined by the 
formula (41) there. This formula is independent of the specific model M. 
Thus, if we let p(y, z) be this formula, it follows that [p;Y,Z]: [v, Y] - 
[x, Z] is the desired unique arrow in B(T) such that [p] 0 [o;] = [7:] for 
each i= 1,...,n. 

Finally, we conclude this section with a remark concerning founda- 
tional aspects of our construction of the category B(T). Strictly con- 
strued, on the basis of the usual axioms for set theory, this construction 
does not produce a small category B(T), because it refers to all models 
in all topoi. This quantification should be replaced by some quantifica- 
tion restricted to a class. 

This can be arranged in several ways. 

For example, instead of considering all] models M in all topoi €, 
one could restrict the consideration to those topoi defined in Zermelo— 
Freenkel set theory by a formula with only set-parameters—no class pa- 
rameters. The indicated collection of all such topoi € does contain all 
Grothendieck topoi since these are definable in terms of their sites, which 
can then be regarded as the set parameters. This collection also contains 
all other topoi that occur explicitly in this book or, indeed, that occur 
in ordinary mathematical practice. 

Another approach to foundational issues is to work in a set-theory 
with an adequate supply of “universes”, in the usage of Grothendieck & 
Co. Then our B(T) lives happily in some higher universe. 

For the category B(T), the equivalence relation on formulas used to 
define the objects and arrows can, in fact, be described by considering 
only the models of the theory T in Sets. This will be proved in Corol- 
lary 7.2 below. However, the proof of this theorem uses the more liberal 
construction given above for the sets B(T). 

There is a more constructive approach: axiomatize the notion of 
“truth” in all T-models in all topoi. Indeed, one may readily spec- 
ify Gentzen-style derivation rules, stating when a formula of the form 
Va (d(x) > Y(2x)) as in §3(6) is provable in a geometric theory T. One 
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then constructs a category Bg(T) like B(T), but with the equivalence 
relation used to define objects and arrows formulated in terms of prov- 
ability. Then “provable” replaces “true in all models in all topoi”. A 
“soundness” theorem—provable implies true in each model—will then 
provide results for the Bg(T) similar to Theorem 1 of §6 below. 


6. The Classifying Topos of a Geometric Theory 

Let T as before be any geometric theory in a language L. In this 
section, we will construct a classifying topos for T-models. The con- 
struction will use the category Mod(T, €) of T-models (§2) in any topos 
€, and the construction in the previous section of the syntactic cat- 
egory B(T), equipped with its Grothendieck topology J(T). Write 
B(T) = Sh(B(T), J(T)) for the topos of sheaves on B(T) with respect 
to this topology. 


Theorem 1. The topos B(T) is a classifying topos for models of T. 


More explicitly, this theorem states that for any cocomplete topos 
€, there is an equivalence of categories 


Hom(€, B(T)) = Mod(T, €), (1) 


natural in €. 

The proof of this theorem will apply the results of Chapter VII. 
Since B(T) is a category with finite limits (as constructed in §5), ge- 
ometric morphisms f: € — B(T) correspond to left-exact continuous 
functors A: B(T) — E, by Corollary VIL9.4. Thus, if ConLex(B(T), €) 
denotes the category of left-exact continuous functors, Theorem 1 may 
be rephrased as: For any cocomplete topos €, there is an equivalence of 
categories 


ConLex(B(T), €) & Mod(T, €), (2) 


natural in €. We will prove this by giving an explicit construction of 
a T-model My, in the topos € from any continuous left-exact functor 
A: B(T) — €, and conversely, of such a functor Ay: B(T) > € from 
any given T-model M in €. 

The second construction from M will use the category Def(M) 
of definable objects and arrows in €, as well as the canonical functor 
Fy: B(T) — Def(M) from the previous section, and the forgetful func- 
tor Def(M) — € of §4(16). Write 


A= Ay: B(T) — Def(M) 3 € (3) 
for the composite of these two functors. By Corollary 5.3, Ay, is left- 


exact and continuous. By this definition (3) any object [d(x), X] of the 
syntactic category B(T), as given by a formula ¢(x), has 


Am (Id, X]) = {a | d(x) }™. (4) 
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Let H: M — M' be a homomorphism between two models M and 
M’ of T in €. By induction on the construction of the geometric formula 
¢(z) one shows that this H: X™) — x( maps {« | (x) }™ into 
{x | (xz) }™’, as in the commutative diagram 


{z | d(z) 1k oe al Kee X XO) =~ X(M) 


: | 
{ x | p(x) $$ {9 x x XD = XM, 


The dotted arrows on the left, present for all geometric formulas ¢, 
constitute a natural transformation Ay — Ayq of functors from B(T) 
to €. Thus, the assignment M+ Ay, is a functor 


Mod(T, €) > ConLex(B(T), €). (5) 


This gives one direction of the desired equivalence (2) of categories. 

In the reverse direction, we are to associate with each continuous left- 
exact functor A: B(T) > € a suitable model M = M, of the theory T 
in the topos €. For each sort X; of the language L we use the formula 
x; = x; for a variable x; of the sort to define the object 


XM = Aoi = 24, Xi) (6) 
of €. For each relation symbol R C X, x... x X, of L there is a corre- 


sponding formula R(x) = R(21,...,¢7) and thus an object [R(x), X] in 
the syntactic category B(T); for the corresponding object in € we set 


R(™4) = A((R(2x), X]). (7) 


For a sequence of variables x = (21,...,%,), the formula x = z is to 
be understood as an abbreviation of the conjunction ©; = 21 A:-+Aadpy = 
Ln. If as before X; denotes the sort of the variable x; and X the sequence 
of sorts X),...,Xp», then in the syntactic category B(T) the object 
[x = x, X] is the product of the various objects [x; = 2;, X;] [as follows 
from the explicit description of limits in B(T) from §5]; thus, 


[ew = 2, X] = [vy = 21 A-+-A Sn = In, X] 


Sa, S21, 2X1] <4. [Sp = CaiAal- 


(8) 
Thus for the left-exact functor A we have 


Xa) = XM) x 1. x X(Ma) (by convention) 
= A([x1 = r1,X,]) x +++ x A([an = Zn, Xn)) [by definition (6)] 
= A([x = x, X]) (by left-exactness of A 
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so the left-exactness of A with (7) yields a monomorphism 
ROMA), X(MA) yy X(Ma) = X (Ma), (9) 


Finally, if f: X, x ---x X, — Y is a function symbol of the language L, 
then in any model M the graph of the corresponding arrow f™): X an x 
-.+x Xi“) _, YD ig defined by the formula “f(x1,...,2,) = y”, which 
we abbreviate as “f(x) = y”. Thus in B(T) there is an arrow 


(f(z) a yl: ({z1 = az, X,]x- (tn = Ln, Xp]) = [x = x, X] 7, ly =yY 
We define f(4) in E to be the image of this arrow under the functor A: 
FOE) = A([F(a) = wf): XG) x x XIN) OA), (10) 


This definition also includes the case of constants of L, which may be 
regarded as function symbols with no arguments. This completes the 
definition of the interpretation M, of the language L in the topos €. It 
is readily seen to be functorial in A. The following lemma will imply 
that this M4 is indeed a model of the theory T. 


Lemma 2. Let M, be the model in E associated with a left- 
exact continuous functor A: B(T) — €. For any geometric formula 


$(21,...,%,), with free variables x = (x1,...,2n) of the respective sorts 
X\,...,Xn, there is a natural isomorphism 
{x | d(x) }%™ = A([¢(2)]) (11) 


of subobjects of X(M4) x ... x xe 


Proof: The proof proceeds by induction on the construction of the 
geometric formula ¢, using both the left-exactness and the continuity of 
the functor A. 

First of all, since A preserves products, there is for each sequence 
£ = (£)..., 2p) of variables of sorts X,..., Xp an isomorphism, where 
“r= x2” stands for “xz; = 2) A+++ ALn = In”: 


A([e = 2, X]) © A((er = 21, Xa]) x ++ x A((tn = en, Xn) 


a x(a) y 0. x K(Ma), (12) 
as stated above. Now, if t(2,,..., 2) is a term of the language L of sort 
Y, with free variables x,,..., 2, of sorts X,,..., Xn, then for any model 


M in any topos, the graph of the arrow t(™): xi) x XM) _, y(M) 
[see §2(1) ] is defined by the formula t(1,...,2%n) = y, or t(z) = y. So 
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this formula defines an arrow [t(x) = y]: [x = 2,X] > [y = y,Y] in 
B(T), and one can prove the following equality of arrows in E: 


A([e(x) = y]) = t0!4): XM x oe x KMa) _, y(Ma), (13) 


Indeed, if t is a single function symbol f(r1,...,2n), this holds by def- 
inition (10), and the general case follows readily by induction on the 
construction of the term ¢. 

Next consider the case where ¢ in (11) is an atomic formula of the 
form R(ti(z),...,tm(x)) for terms t,,...,¢, and a relation symbol R. 
Notice first that, by definition, there is for any model M in any topos a 
description of {--- | R}™ by pullback as in §2(4). From this it follows 
readily that 


[R(ti(x),-..,tm(x)), X] —————> [R(y), Y] 


| | ws 


[c = x, X] y=4¥,¥] 


is a pullback in B(T). Since the functor A: B(T) — € is left-exact, it 
sends this pullback (14) to a pullback in €. But by the definition of the 
model My [as in (6)-(10)], we have A((a = a]) = X(“4 x ... x xf) 
and A([y = yj) = YO“) x.» x YAM), while A([R(y)]) = ROM) = 
{y | R(y)}@). So compare the pullback in € obtained by ap- 
plying A to (14) with the pullback §2(4) for the special case where 
M = My. We then find that A([R(ti(x),...,tm(x)),X]) = {a | 
R(ti(z),.-.,tm(z)) }44). This shows that the identity (11) holds for 
the atomic formula R(t (z),...,tm(x)). Other types of atomic formulas, 
T (“true”), L (“false”), and equalities t(z) = t’(x) between terms t = t’, 
are treated similarly. 

Next, suppose the lemma holds for two formulas ¢(x) and w(x), with 
free variables x),...,2n of sorts X),...,X,. We then wish to show that 
it holds for the conjunction ¢(x) A w(x). First, for any model M in any 
topos F, the definition of conjunction gives a pullback in that topos F 
of the form 


——>+ 
[ta (z)=y1A--Atm(z)=¥m] 


{x | d(x) A p(x) > {2 | o(z) }¥ 


| si | (15) 


{x | b(x) #4 {a | a= 2}! = (xf) x x XM). 


It then follows from the construction of limits in Lemma 5.1 that the 
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similar square in B(T) is a pullback: 


[d(z) A d(x), X]—> [4 (2), X] 


| po | (16) 


[(x), X]~_———— [x = 2, X]. 


Since the functor A: B(T) — € is left-exact, we thus obtain a pullback 
in €: 
A([6(x) A (2), X])———> A([(2), X]) 


| a | (17) 


A([y(z), X])»——__> A([z = £, X)). 


But by definition of the model M4, A([a = x, X]) = X(“4) x. +. X(M4), 
while by induction hypotheses on ¢ and wy there are isomorphisms 
A([o(a), X]) & {x | d(x) }™4) for ¢ and similarly for Y. Thus by com- 
paring the pullback (17) with the special case of (15) where M = Ma, 
we find that 


A([$(z) A d(x), X]) = {x | (2) Ad(2) }%™, 


so the lemma holds for the conjunction ¢ A y. 

Next, still assuming that the lemma holds for the formulas ¢ and yw, 
consider their disjunction ¢(r) V y(x). We will unwind the definitions 
to show that the two arrows 


[o(z), X] [d(x) V ¥(z),X] and [p(x), X] >[¢(z) V ¥(z), X] 
form a cover for the topology J on B(T). By the definition of J this is 
the case when for all M the arrows (in an alphabetic variant) 


{x" | $(2*)}¥ = {y| dy) Voy), 
{ x? | b(2*)}”¥ = {y| d(y) V v(y) }” 


form a cover in the topology of Def(M/). Here the first arrow is defined 
by a formula (notation as in Lemma 4.5) 


oi(z",y) = (2! = yA g(x) A (Oly) V o(y))), 


and similarly for the second. Now apply Lemma 4.3; these arrows cover 
if the formula 


Vy ([e(y) V b(y)] > Aa? [at = y A G(2') A (dy) V ¥(y))] 
Vaa? [2 = y A o(x?) A ((y) V b(y)))) 
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1 and 2%, this is evident, 


holds in M. Substituting y for the required x 
so we have the asserted cover for J. 

Since the functor A is continuous for covers, it follows that 
A((o(z), X]) and A((y(x), X]) form a cover of A((6(x) V(x), X]). Thus, 


as subobjects of Xa) , 


A([9(z), X]) V A([b (x), X]) = A([d(@) Vv o(a), X])- 


But, by the induction hypothesis, A({¢(z), X]) = {2 | ¢(z) }4), and 
similarly for 7, while by definition of the interpretation of disjunction 


{x | b(x) V (2) }™ = {2 | o(2) } v {2 | o(a) }O™. 


Thus, A([¢(z) V ¥(z),X]) = {2 | o(z) V d(e)}%), and the lemma 
holds for the disjunction ¢ V y. 

Finally, the case of an existential quantifier follows in much the same 
way from the continuity of the functor A, noting that for a formula 
o(x,y) the evident projection arrow [¢(2,y), X,Y] — [By d(z, y), X] in 
B(T) is a covering arrow. 

Since every geometric formula is built up from atomic formulas using 
conjunction, disjunction, and existential quantification, it follows that 
(11) holds for all geometric formulas, so the lemma is proved. 


Lemma 3. For any geometric theory T and any continuous left- 
exact functor A: B(T) — € the associated interpretation M4 of L in € 
is a model of T. 


Proof: In a geometric theory T, each axiom has the form 


(Va)((x) > p(z)), 


where ¢ and w are geometric formulas. Hence in every model M of 
T we have an inclusion {x | ¢(z)}™“ C {x | o(x)}™, and hence a 
corresponding inclusion 


[6, X] [y, X] 


by the definition of the syntactic category B(T). The left-exact functor 
A sends this monomorphism to an inclusion of subobjects 


Ald, X] < Aly, X] 
of the object A([z = 2]) = X™4). By Lemma 2, this means that 
{x | d(x) }%4 < {| v(x) }%. 
Hence the axiom Vz (¢(x) — w(z)) of T is valid in the model Ma. 
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Lemma 4. The two constructions, M +> Ay of a continuous left- 
exact functor Ay: B(T) — € from a model M in €, and At> Mg of 
a model Ma from a left-exact continuous functor A: B(T) — €, are 
mutually inverse up to natural isomorphism. 


Proof: One way around, start with a model M of the theory T in 
a topos €. There is then an associated functor A = Ay: B(T) — € 
defined as in (4) above, and thus a new model My, associated with this 
functor A as in (6)-(10) above. Then for any sort X of the language L, 
and for a variable x of that sort X, 


X(MA) = A([x = 2, X]) [by definition (6)] 
~{z|2=2}™" [by definition of A from M, (4)] 
~ Xx. 


And similarly, for a relation symbol R C X, x --- x X, of the language, 


R™4) = Ay([R(z),X]) [by definition (7)] 
~ {x| R(x)}“ [by definition (4)] 
~ R™) [by definition §2(4)]. 


Finally if f: X, x --- x X, —- Y is a function symbol, 


Graph(f(4)) = { (a,y) | f(x) = y}™ (true in any model) 
= A([f (x) = y, X, Y]) (by Lemma 2) 
= {(z,y) | f(e)=y}" [by definition (4)] 
~ Graph(f™)). 


This shows that the new model M4 is isomorphic to the old one M. 

The other way around, start with a left-exact continuous functor A, 
to get first a model M = My in € and then a new functor Ay. Then 
for an object [¢(x), X] of B(T), 


Am([6(x), X]) = {x | d(x) }™4) [by definition (4)] 
= A(([4(x), X]) (Lemma 2). 


This isomorphism is natural, hence gives an isomorphism of functors 
A=Am. 


This proves the lemma. Now the construction M 1 Ay of a functor 
from a model (5) is functorial and the inverse construction A ++ My, of 


566 X. Geometric Logic and Classifying Topoi 


a model out of a functor is also functorial. It then follows that the two 
constructions establish an equivalence of categories 


Mod(T, €) = ConLex(B(T), €), 


as announced in (2) above. To complete the proof of Theorem 1, it now 
suffices to observe that both constructions are natural in €. Since they 
are mutually inverse by Lemma 4, it suffices in fact to show that one of 
them is natural in €. Consider for instance the construction of functor A 
from a model M in the topos €. Naturality requires, for any geometric 
morphism g: F — € between topoi, that the square 


Mod(T, €) ——~ ConLex(B(T)), €) 
r| [sme with g” (18) 
Mod(T, #) —~— ConLex(B(T), F) 


commutes up to natural isomorphism. But for any model M of T in € 
and for any geometric formula ¢(x), we have 


9° (Am ([6(z),X])) = 9°({2 | o(2) ") [by definition (4)] 
~{xr| d(x) }9™/ (Theorem 3.5) 
= Ag-(m)([¢(x), X]), 


the latter by definition of the functor A,-(,4) from the model g*(M). 
Thus the square (18) commutes up to natural isomorphism. Therefore 
Theorem 1 is established. 


7. Universal Models 


The previous section proved for an arbitrary geometric theory T that 
the topos B(T) of sheaves on the syntactic category B(T) is a classifying 
topos for T-models. Therefore, for any cocomplete topos €, there is an 
equivalence of categories 


Mod(T, €) = Hom(€, B(T)) (1) 


between T-models in € and geometric morphisms € — B(T). As in 
Chapter VIII, this implies that there exists in B(T) a “universal” model 
of T, call it Ur. It is that model in the topos B(T) which corre- 
sponds under the equivalence (1) to the identity geometric morphism 
B(T) — B(T). It is universal, in the sense that any other T-model in 
any (cocomplete) topos € is an inverse image of this particular model. 
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Indeed, recall the argument from §VIII.3: By the naturality of (1) each 
geometric morphism f: € — F gives by (1) a diagram of categories and 
functors 


= 
a 


(T, F) —~— Hom(F, B(T)) 


r| fasmcc2cry (2) 


Mod(T, £) —~— Hom(€, B(T)) 


which commutes up to isomorphism. 

Now any model M of T in € corresponds by the equivalence (1) 
to a geometric morphism cy: € — B(T); so by chasing the identity 
B(T) — B(T) in two ways from upper right to lower left around the 
square (2) for the special case F = B(T) and f = cm, we find that there 
is an isomorphism of T-models ci,(Ur) = M in €. This shows that, 
up to isomorphism, the arbitrary model M is the inverse image of the 
universal model Uy along a suitable geometric morphism, namely the 
morphism cy: € > B(T). 

Let us take a closer look at the equivalence (1). Take a geometric 
morphism f: € — B(T) as on the right of (1). Its inverse image func- 
tor f*, composed with the Yoneda embedding y of B(T) followed by 
sheafification a for the Grothendieck topology J(T) of §5, gives the left 
exact continuous functor 


A= f*oaoy: B(T)-€ 


as in the equivalence of Corollary VII.9.4. In turn, such functors 
A: B(T) — € correspond to T-models Mz, in €, as described explic- 
itly in the previous section. For any geometric formula ¢(x) subobjects 
in such a model M, are related to the functor A by the natural isomor- 
phism 

{x | p(x) }% = A((d, X]) (3) 


of Lemma 5.2. But by Lemma 5.4, the topology on B(T) is subcanonical, 
which means that the Yoneda embedding already sends objects of B(T) 
to sheaves; so y = ay: B(T) — B(T). Thus, as a special case of the 
equivalence (1), the universal model Up in B(T) corresponds initially to 
the identity B(T) — B(T) and then, by the equivalence between models 
and left exact functors, to the Yoneda embedding y: B(T) — B(T) 
itself. Therefore, as a special case of (3), the universal model Up has 
the property 

{x | 6a) 7? yl, X) (4) 


for every geometric formula ¢(z). 
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Theorem 1. Let T be a geometric theory. For any two geometric 
formulas ¢(x) and w(x), the formula Vx (g(x) — (x)) holds in the 
universal model Ur in the classifying topos B(T) iff this formula holds 
in every model M of T in every topos €. 


Thus the universal model U7 is in this sense a minimal model of the 
theory T. 


Proof: The “if” part of the theorem is, of course, clear. For the 
converse, suppose Vz (¢(x) — y(x)) holds in U7, so that we have an 
inclusion {x | (xz) }U7) < {ax | W(x) }7) of subobjects in the topos 
B(T). So in that topos, there is an arrow (dotted below) for which the 
following diagram commutes 


{x | (x) }UT) ------------------------------ > {a | p(x) }Ur) 


ue (5) 


By (4) above, all the objects in this diagram are representable. Since the 
topology on B(T) is subcanonical (Lemma 5.4), the Yoneda embedding 
is a full and faithful functor y: B(T) — B(T). Thus, a dotted arrow in 
B(T) exists, as in (5), iff a dotted arrow exists which makes the following 
diagram commute in B(T): 


But for any model M in any topos € (not necessarily cocomplete), the 
functor Fy: B(T) — Def(M) of §5(6), (7) sends the diagram (6) above 
to a commutative diagram 


{x | d(x) }™ 


wa (7) 


Kw 


x) KX... XM), 


{x | (x) }™ 


of definable objects and arrows in €. Thus { x | #(x) }™ < {a | w(x) }¥ 
in €, and hence Vz (¢(x) — #(x)) holds for the model M in €. 


Combined with Deligne’s theorem, this gives a surprising result: 
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Corollary 2. Let T be a geometric theory. A formula Vz (¢(x) > 
w(x)) as above holds in all models of T in any topos, iff it holds in all 
models of T in the topos Sets. 


Proof: The “only if” part is clear. For the converse, suppose 
Va (d(x) — o(x)) holds in any model M of T in Sets. By Theorem 1, it 
suffices to show that it holds in the universal model U7 in the classifying 
topos B(T). But the site (B(T), B(T)) for the topos B(T) is given by 
a category B(T) with finite limits and a Grothendieck topology given 
by finite covering families (§5); in other words, this site is of finite type, 
so that the classifying topos B(T) is a coherent topos (§IX.11). By 
Deligne’s theorem (Corollary [X.11.3), B(T) has enough points. Con- 
sider any such point p: Sets — B(T). Then M = p*(U7) is a model of 
the theory T; hence, since Vz (¢(x) — 7(zx)) is assumed to hold in all 
models in Sets, we have { x | d(x) }¥ < {z| (x) }”. By Theorem 3.5 
and the definition of M, this inclusion of subobjects may equivalently 
be written as 


p*({2 | (2) }97) < p*({2 | ¥(e) }"*). 


Since this holds for any point p: Sets — €, and since B(T) has enough 
points by Deligne’s theorem, we conclude by [X.11(5) that 


{x | (x) }°? < {x | (x) }¥". 


This shows that Vz (¢(r) — w(z)) holds in the universal model U7, as 
desired. 


Corollary 2 is very useful in practice. For example, many properties 
of rings and modules in homological algebra can be expressed in the form 
Vx (¢(r) — (x)), where ¢ and w are geometric formulas in a suitable 
language. Hence, to verify whether such properties hold for all rings, 
modules, etc., in a topos, it is enough to verify whether they hold for 
“ordinary” rings, modules, in the classical topos Sets. 


Exercises 


1. With the notation of §5, show that for any geometric theory T, 
the product of two objects [X; ¢(x)] and [Y; ~(y)] in the category 
Def(M) is given by [X, Y; ¢(x) \w(y)]. Also show that the equal- 
izer of two maps [0(z, y)] and [7(z,y)]: [X; o(2)] — [Y;(y)] is 
given by [X; (¢(x) A Sy(0(z,y) A7(z,y))). 

2. Show that classifying topoi are unique; that is, if for a geometric 
theory T both B(T) and B’(T) are topoi for which there is a 
natural equivalence as in §6(1), then B(T) is equivalent to B’(T). 
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X. Geometric Logic and Classifying Topoi 


3. Let B(T) be the classifying topos for a geometric theory T, with 


universal model Ur in B(T) as in §6. Let ¢(x) be a geometric 
formula in variables x = (21,...,2n) and A = {x | ¢(x) }47 
the corresponding object of B(T). Show that the slice category 
B(T)/A is the classifying topos for the geometric theory obtained 
from T by adding a new sequence of constants c = (c),...,¢€n) to 
the language (where c; is of the same sort as a variable z; in the 
sequence x), and one new axiom ¢(c). (Hint: see Exercise VIII.6.) 


. Let E = Sh(C, J) be a coherent topos, where (C, J) is a site of 


finite type, as in §IX.11. Describe a geometric theory T such that 
for any topos Ff, there is a natural equivalence between models M@ 
of T in F and left exact (or flat) continuous functors A: C > F. 
(Hint: take a language with all objects of C as sorts and all 
arrows of C as function symbols.) Conclude that € is equivalent 
to the classifying topos B(T). (Thus, every coherent topos is the 
classifying topos of some geometric theory.) 


. Let S be the Sierpinski space {0,1} (with {1} but not {0} an 


open subset of S). 


(a) Observe that the topos Sh(S) of sheaves is the functor 
category Sets? (here 2 is the poset 0 < 1, viewed as a 
category). More generally, show that for any Grothendieck 
topos €, the product Sh(S) x € as in Exercise VII.13 is 
(equivalent to) the arrow-category E?. 

(b) For any geometric theory T, describe explicitly a new ge- 
ometric theory T’ such that for any topos €, a model of 
T’ in € is a homomorphism of T-models in €; observe also 
that this is the same thing as a model of T in €?. 

(c) Conclude that for any coherent topos ¥, the exponential 
topos F**(5) exists, and is again coherent; formulate this 
as an equivalence of categories between geometric mor- 
phisms € — F55(5) and Sh(S) x € — F. (Use Exercise 4.) 


. This exercise is related to Exercise VIII.8. Let T be the geometric 


theory, in the language with one binary relation symbol D (for 
“distinct”), given by the following axioms: (T — dr(x = z)), 
Vr, y(x£ =yV D(z, y)), Vz, y(x = yA D(z, y) > L), and for each 
n > 1 the axiom Vz1,...Vtn(T > Sy(D(ai,y) A+: A Dan, y))). 
Show that a model of T in a topos is exactly an infinite decidable 
object. Thus the classifying topos must be equivalent to the topos 
B(Aut(N)) of continuous Aut(N)-sets. Prove this equivalence 
directly, by using the Appendix and 8III.9. 


. A first-order language is said to be propositional if there are no 


sorts. Such a language only has relation symbols R (necessarily 
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taking zero arguments!). As an example, consider the language 
L which has for each finite sequence u of zeroes and ones such 
a relation symbol R,. Let T be the geometric theory in this 
language L with the following axioms (for all binary sequences wu, 
v): 

(Al) T — Rg (where @ is the empty sequence); 

(A2) Ru Ru-o V Ru~i (— for concatenation); 

(A3) R, —- R,, whenever v is an initial segment of u; 

(A4) R, A Ry, — L, whenever u and v are incompatible. 


[Two sequences u = (to,..-,Un—1) and v = (U9,...,Um—1) are 
said to be incompatible if u; 4 v; for some i < n, m.] Show that 
the classifying topos B(T) is equivalent to the topos Sh(2N) of 
sheaves on the Cantor space. (You may wish to use the Appendix 
here.) Conclude from Exercise VIII.10 (or show directly) that 
B(T) classifies maps N — 2. 


Appendix: 
Sites for Topoi 


A Grothendieck topos was defined in Chapter III to be a category of 
sheaves on a small site—or a category equivalent to such a sheaf cate- 
gory. For a given Grothendieck topos € there are many sites C for which 
E will be equivalent to the category of sheaves on C. There is usually 
no way of selecting a best or canonical such site. This Appendix will 
prove a theorem by Giraud, which provides conditions characterizing 
a Grothendieck topos, without any reference to a particular site. Gi- 
raud’s theorem depends on certain exactness properties which hold for 
colimits in any (cocomplete) topos, but not in more general cocomplete 
categories. Given these properties, the construction of a site for topos 
E will use a generating set of objects and the Hom-tensor adjunction. 
The final section of this Appendix will apply Giraud’s theorem to the 
comparison of different sites for a Grothendieck topos. 


1. Exactness Conditions 


In this section we consider a category € which has all finite limits and 
all small colimits. As always, such colimits may be expressed in terms 
of coproducts and coequalizers. We first discuss some special properties 
enjoyed by colimits in a topos. 

Coproducts. Let E = ||, Ex be a coproduct of a family of objects 
E, in €, with the coproduct inclusions i,: Ey — E. This coproduct 
is said to be disjoint when every ig is mono and for every a # £ the 
pullback E, x g Eg is the initial object in €. Thus in the category Sets 
the coproduct, often described as the disjoint union, is disjoint in this 
sense. (On the other hand, in the category of commutative rings the 
coproduct is the tensor product which is not generally disjoint.) 


The coproduct E = J], Ez is said to be stable (under pullback) when 
for every map E’ — E in € the pullbacks E’x gE, — E’ of the coproduct 
inclusions ig: Ey — E yield an isomorphism ][,(E’ xg Eq) & E’. 
Hence, if every coproduct in € is stable, then for each map u: E’ = E 
the pullback functor u*: E/E — €/E' preserves coproducts. Thus, in 
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this case every square of the form 


] [(4e xz E’) —— |] Aa 


is * 


E' —————— E 


is a pullback in €; or, equivalently, the pullback operation — xg E’ is 
distributive over coproducts, as in 


([ 4a) xz E’ =] [Aa xz £). (2) 


Equivalence Relations. An equivalence relation on an object E 
of € is a subobject R C Ex E satisfying the usual axioms for a reflexive, 
symmetric, and transitive relation, as expressed in the appropriate dia- 
grammatic way. If we write (00,01): R— E x E for the monomorphism 
representing the subobject R, these axioms are 


(i) (reflexive) the diagonal A: E — E x E factors through 
(90,0:): RO Ex E; 
(ii) (symmetric) the map (0,,00): R — E x E factors through 
(0,01): RO EXE; 
(iii) if R * R denotes the following pullback 


R*eR—2 >R 


a 


R—z— E, 
1 


then (07, 0172): R* R — E x E factors through R. 


To explain the condition (iii), observe, in the case where € is the category 
of sets, that the pullback R * R is just the set of all those quadruples 
(x,y, 2,w) with (x,y) € R, (z,w) € Rand y = z. So in this case, (iii) 
expresses the usual transitivity condition for an equivalence relation on 
a set. 

The quotient E/R of an equivalence relation R C E x E in E is 
defined, much as in Sets, to be the object E/R given by the following 
coequalizer diagram in € 


R= E—*E/R, (4) 


provided such a coequalizer exists. The coequalizing map g, when it 
exists, is always an epimorphism. 
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If u: E — D is any morphism in €, the kernel pair of wu is a parallel 
pair of arrows 0), 0,: R — E, universal with the property that wd = 
uO,; thus the object R is the pullback of u along itself, as in the diagram 


Re oe 


J ° 


E—,— D. 


It follows readily that (@,0:): R —~ E x E is a monomorphism, and 
an equivalence relation. Such an equivalence relation, arising as the 
kernel pair of some arrow wu, is automatically also the kernel pair of its 
quotient map q: E — E/R when the latter exists. However, the converse 
assertion (every equivalence relation is the kernel pair of some map, or 
equivalently, of its quotient map) holds in Sets, but not always; we will 
need it as a requirement for a Grothendieck topos. 

Similarly, the coequalizer of any parallel pair of arrows is necessarily 
also the coequalizer of its kernel pair, but we must require for a topos 
that every epi is the coequalizer of some parallel pair (or equivalently, 
of its kernel pair). 

A diagram of the form of a “fork” 


R—=3 EQ, (6) 


is said to be exact if g is the coequalizer of 0) and 0,, while these form 
the kernel pair of g. This diagram (6) is stably exact if it remains exact 
after pulling back along any map Q’ — Q in €. This means that the 
diagram 


RxQ Q'S ExgQ’ > (QxqQ) =, (7) 


obtained from (6) by pullback, is again exact. [As in the case of coprod- 
ucts, if every exact diagram (6) in € is stably so, then for any arrow 
u: A — B in € the pullback functor u*: E/B — &/A preserves exact 
forks.] 

A set of objects {C; | i € I} of E is said to generate E when, for 
any two parallel arrows u, v: E — E’ in €, the identity uw = vw for all 
arrows w: C; — E from any object C; implies u = v. Equivalently, this 
means that for each object E, the set of all arrows C; — E (for all i € I) 
is an epimorphic family. Again this means that whenever two parallel 
arrows u, v: E — E’ are different, there is an arrow w: C; — E from 
some C; with uw 4 vw. 

We can now formulate the Giraud theorem for Grothendieck topoi. 
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Theorem 1 (Giraud). A category € with small hom-sets and all 
finite limits is a Grothendieck topos iff it has the following properties: 


(i) € has all small coproducts, and they are disjoint and stable under 
pullback; 

every epimorphism in € is a coequalizer; 

every equivalence relation R= E in € is a kernel pair and has a 
quotient; 

every exact fork R= E — Q is stably exact; 

there is a small set of objects of E which generate E. 


(ii 


Note that (ii) and (iv) together state that for each epimorphism 
B — A, the fork B x4 B= B— Ais stably exact. Also (iii) and (iv) 
together state that each equivalence relation R C E x E has a quotient 
E/R for which the resulting fork RE > E/R is stably exact. 

The necessity of these conditions for a Grothendieck topos is eas- 
ily established. Indeed, properties (i)-(iv) obviously hold for the cat- 
egory of sets. Hence, they are true in the functor category Sets©” 
constructed from any given small category C because limits and col- 
imits in such a functor category are computed pointwise. Finally, 
if J is any Grothendieck topology on such a C, the inclusion func- 
tor Sh(C, J) Sets©”” from the category of all J-sheaves has, as in 
Chapter ITI, a left adjoint, the associated sheaf functor a: SetsC” 
Sh(C, J). The sheaf category Sh(C, J) is closed under finite limits in 
Sets©” , hence has all finite limits. The colimits in Sh(C, J ) are com- 
puted, as in Chapter III, by first constructing the colimit in Sets” 
and then applying the associated sheaf functor a. Since this functor a is 
a left adjoint and is left exact, it must preserve colimits and finite lim- 
its. Hence, Sh(C, J) inherits the exactness properties (i)~(iv) from the 
presheaf category Sets©”. The property (v) requiring generators also 
holds for each Grothendieck topos €, as observed at the end of §II1.6. 

The sufficiency of the conditions of Theorem 1 will be proved at the 
end of §3. 


2. Construction of Coequalizers 
In this short section we will prove 


Proposition 1. Any category € with small hom-sets and all finite 
limits which satisfies conditions (i)—(iv) of the Giraud theorem has all 
coequalizers, hence is cocomplete. 


The following factorization lemma is a first step. 


Lemma 2. Every arrow u: E — A in € can be factored as an epi 
followed by a mono, as in E> B— A. 
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The subobject B is then called the image of u. 


Proof: Take the kernel pair R = E x4 E of u: E — A, with its 
projections 7, and m2: E x4 E — E. Let p: E — E/R be the coequal- 
izer of 7 and 7 [this coequalizer exists by condition (iii) of Giraud’s 
theorem]. Since uz, = um2, we can factor w through this coequalizer p 
as in the commutative diagram 


R= Ex, E===— E——_*—_. A 


V7 


Since p is a coequalizer, it is epi. To prove that the other factor v is 
mono, we use the fact that epimorphisms in € are stable under pullback 
[a consequence of conditions (ii) and (iv) of the theorem]. So, to show 
that v is mono, consider any two parallel arrows f, g: T > E/R in E 
for which vf = vg. Then the pair (f,g): T — (E/R) x (E/R) factors 
through the pullback (F/R) x 4 (F/R). Now p x p is the composite of 
the following two pullbacks of p, 


px p: Ex, E—~2s Ex, (E/R) > (E/R) xa (E/R), 


hence is epi. Therefore so is the pullback q of p x p along (f,g), as in 
the diagram 


ae eee R= SE 


| [pe (2) 


T 75> (E/R) xa (E/R). 

The pullback T’ — E x, E of (f,g) has components f’ and g’ as indi- 
cated, so that 7,0(f',g’) = f’ and 72.0(f',g’) =g’: T’' — E. Therefore 
pf’ = pm (f',9’) = pro(f',g') = pg’ [the middle identity by commuta- 
tivity of (1)]. Thus, by commutativity of the pullback diagram (2), we 
also have fq = pf’ = pg’ = gq. Since q is epi, we conclude that f = g. 
This shows that v is mono, so that u = po v is the desired epi-mono 
factorization of u. 


One readily shows that this factorization is unique, up to isomor- 
phism. Hence the image of u is uniquely defined. 

In the category of sets, the coequalizer of an arbitrary pair u, 
uv: B= A of parallel arrows can be constructed by first taking the equiv- 
alence relation R on the set A generated by u(b) Rv(b) (for all b € B), 
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and then taking the resulting quotient A/R. The same construction 
applies to our category E: 


Proof of Proposition 1: We wish to construct the coequalizer 
of any parallel pair of arrows u, v: B= A in €. Define a sequence of 
subobjects (09, 07): Ry — Ax A (for n = 0,1,2,...), as follows. Ro is 
the diagonal A A x A. The subobject R, is the image of 


B+ B+ A Utowty Ax A: 


this image exists by Lemma 2. For n > 1, Rnp4i is the image of 
(Of, OP t2): Ry * Rn — Ax A, where R, + Ry = Ry Xa Ry isa 
pullback of the form (1.3) above. This gives an increasing sequence 
Ro C Ri C Ro C... of subobjects of A x A. Their union can be con- 
structed as the image of |], Rn — Ax A, call it (09,01): RA A. 
We claim that this R is an equivalence relation, and that its coequal- 
izer g: A — A/R [which exists by condition (iii) of Giraud’s theorem] is 
the coequalizer of the given maps u and v: B=3 A. Assuming, for the 
moment, that R is an equivalence relation, the second assertion follows 
because, for any arrow f: A — X in €, one has fu = fv iff fOo = fO,. 
Indeed, if fOo = fO, then surely fu = fv, since (u,v): B— Ax A fac- 
tors through R, > Ax A, hence through R= Ax A. And conversely, 
if fu = fv, then one easily shows by induction on n that f equalizes OF 
and Of: R, — A, for each n. Thus 


B——— A—*— A/R 
is a coequalizer. It thus remains to be shown that R is an equivalence 
relation. By the definition of Ro, the diagonal A — Ax A factors through 
Ro, hence through R; therefore, R is reflexive. Furthermore, Rg and R, 
are clearly symmetric, while the symmetry of R, readily implies the 
symmetry of Ryi1; hence R is symmetric. Finally, to prove that FR is 


transitive, consider for m and n the following pullback diagram, built 
on the pullbacks like (1.3) for R * S: 


Rp * Rn ———> R * Rn ———> Ron 


| | | 


R, * R—__—> R« R24 R 


rs: 


Ra R a A. 


For transitivity of R, we must show that (0971, 0172): R* RO AXA 
factors through R. Since ][ Ry, — R is epi by construction, while pull- 
back in € preserves coproducts and epis (by the Giraud conditions), it 
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follows that Ln.m RitRm — R*R is epi. Hence, since R,*Rm C Rp* Ry 
where k > m, n, the map |], Rx * Rk — R* R is also epi. So it suffices 
to show for each k that the map (097, 01%2): Ry * Ry — A Xx A fac- 
tors through R. But by construction, this map factors through Rx41, 
hence also through R. We have now completed the proof that R is an 
equivalence relation, thus finishing the proof of Proposition 1. 


Corollary 3. For a category € satisfying the Giraud conditions (i)— 
(iv) as above, and for any category C, if a functor F: € — C preserves 
coproducts, finite limits and coequalizers of equivalence relations, then 
F preserves all coequalizers (and hence all colimits). 


Proof: By hypothesis and condition (ii) on €, the functor F will 
preserve epimorphisms. It follows that F' preserves images. Thus F 
preserves all the constructions in the just completed reduction of arbi- 
trary coequalizers to coequalizers of equivalence relations. Therefore F 
preserves coequalizers. 


3. The Construction of Sites 


We return to the converse part of Giraud’s theorem. By hypothe- 
sis (v), the category € has a small set of generators. Take C C € to be 
the small full subcategory of € with these generators as objects. The 
inclusion functor A: C — € then gives rise to a “Hom-tensor” adjunction 


~ @c A: Sets©” ———= E: Hom, (A, —) (1) 


as in Chapter VII, where the left adjoint is written on the left. Recall 
here that the right adjoint in (1) is defined, for each object E of €, as 
the presheaf on C given for each object C' by 


Hom, (A, E)(C) = Hom¢(A(C), E). (2) 


Also, the left adjoint in (1) is constructed, for each presheaf R on C, as 
a coequalizer of the form [cf. VII.2(11)] 


I] 4c¢)= [] 40— Rec. 


ui OC Cc 
re R(C) re R(C) 


(3) 
Here, as in Chapter VII, these coproducts are taken over the arrows and 
the objects, respectively, of the category of elements of R. 

We wish to show that the functor Hom, (A, — ) in (1) sends objects 
E in E not just into presheaves, but into sheaves for a suitable topology 
J on the small category C, so that the adjunction (1) will restrict to an 
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equivalence of categories Sh(C, J) = €. We define J by specifying that 
a sieve S on an object C of C is to be a cover of C when the arrows 
g: D — C belonging to S together form an epimorphic family in €. 
Equivalently, take these arrows g to be the components of a map on the 
coproduct, 


pss [J] D—->C (4) 
(g: DocC)es 


over S; then S covers C when this map pg is an epi in €. In brief, covers 
are epimorphic families. 

This prescription does indeed define a Grothendieck topology on the 
category C. First, the maximal sieve on each object C’ includes the 
identity C — C, hence is a cover in this sense. The transitivity axiom 
for these covers is evident. To verify the stability axiom, consider the 
pullback of such a covering sieve S along any arrow h: C’ — C in C. 
Then since epis and coproducts are preserved by pullbacks in € [by 
assumptions (i)~(iv) of the theorem], the pullback of the epi (4) along h 
yields the following pullback diagram in € with the top map p epi: 


[] >xco ee 


gES 
| h 
[oe 
ges 


Moreover, the objects of C generate €, so there is for each arrow (g: D > 
C) € S an epimorphic family of arrows { Bf — D xc C'}; with each 
domain BY an object of C. Thus the collection of all the composites 
B? = DxcC' —C’, for all g € S and all indices i, form an epimorphic 
family contained in the sieve h*(S). Therefore h*(S) is a cover, as 
required for stability. This proves that the J as defined above is indeed 
a Grothendieck topology on C. 


Lemma 1. For each object E of € the functor Homg(— , E): C°P > 
Sets is a sheaf for the Grothendieck topology J on C. As a consequence, 
this topology is subcanonical. 


Proof: Consider an object C of C C € and a covering sieve S on C 
with the associated epimorphism (4). By the exactness assumptions on 
E, this epi p is the coequalizer of its kernel pair; that is, the pullback of 
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p along p, as in 


([] 2’) xc ([] 2) —— |]? 
g’ES ges ges 


|] 2’ > C. 


ges 


But since pullback in € preserves coproducts (these are stable under 
pullback), one has 


so the coequalizer has the form 


[]0' xcD—| [9c (5) 


959 g 


Here g and g’ range over all arrows g: D — C and g’: D’ — C in the 
sieve S, while 7, and a2 are the unique arrows from the big coproduct 
such that all squares of the form 


t 2 
D' xq D425 | | D’ xo De £2 D' xo D 
959 


my R2 
O71 o2 


D' [[> - D 


commute. In this figure the horizontal maps i are the various coproduct 
inclusions, while 7, and 72 are the projections of the pullback D’ xc D. 
Now the assumption that the objects of C generate € enters again. It 
implies that for each pair of arrows g: D — C' and g’: D’ — C in the 
sieve S there is an epimorphic family {B — D’ xc D} in which each 
domain B is an object of C. Each element of this epimorphic family is 
thus represented by a commutative square in C of the form 


B.D 
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Composing this epimorphic family with the coequalizer (5) then yields 
a new coequalizer diagram 


B= []pD—*_—-c 
He==H (7 
in which the first coproduct is indexed by all B in the commutative 
squares of (6), while for each such square the map a sends the summand 
B to the summand D indexed by g via h: B — D, and similarly § sends 
B via k: B > D' to the summand D’ indexed by the map g’ € S of (6). 
For each object E of € the contravariant Hom-functor €(—, FE) ap- 
plied to the coequalizer (7) in € yields an equalizer 


[[€8, 2) —=— |] é, £) —— £(¢,£) 
B 


ges 


in Sets. This equalizer states exactly that the Hom-functor C 
E(C, E) on C satisfies the sheaf condition for the covering sieve S. Since 
this holds for every covering sieve of the topology J defined as in (4), it 
follows that C » &(C,E) is a J-sheaf. In particular, when £ is itself 
an object of C, one has €(C,E) = C(C,£E) for every object C since 
C C € is a full subcategory; so the functor C + €(C, E) = C(C, E) is 
the representable functor C°? — Sets given by the object F. Thus all 
representable functors are sheaves, which means that the topology J is 
subcanonical. This proves the lemma. 


Now we will investigate the unit and the counit of the Hom-tensor 
adjunction (1) for the category C and the corresponding inclusion func- 
tor A: C-+€. For any presheaf Q € Sets©”, the unit ng: Q—-> 
Hom, (A, Q ®c A) has components 


(ng)e: Q(C) —— Home(A(C), Q @c A) (8) 
for each object C € C. Here the function (jg)c sends each element 
gq € Q(C) to the map 

q®—:A(C)—> Q8c A (9) 
as defined in §VII.8(4). On the other hand, for each object E of € the 


counit ¢g: Hom,(A, E) @c A — E is the vertical map in the diagram 


I] 40C¢)— = [] A> Hom, (A, £) ec A 
ur Cl 3C Cec \ 
r: A(C)3E ri A(C) HE 
(10) 


a 
Le) 


Pp 
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Here the upper part is the coequalizer diagram defining the tensor prod- 
uct Hom,(A, E) ®c A, while p is the map on the middle coproduct whose 
components are the given arrows r: A(C’') — E£, for all objects C € C. 

We already know, by Lemma 1, that the functor Hom,(A, — ) of (1) 
sends objects E of E to J-sheaves on C. Then, restricting the adjunction 
(1) to J-sheaves, we will prove Giraud’s theorem by showing that the 
counit €g is an isomorphism for each object E € €, and that the unit ng 
is an isomorphism for each J-sheaf Q. This will mean that the restricted 
adjunction is an equivalence of categories. 


Lemma 2. The counit ¢g is an isomorphism for every object E of 
the category E. 


Proof: Given £, consider all the arrows r: D — E with domain D 
in the generating category C. Because C generates € these maps r form 
an epimorphic family. In other words, there is an epimorphism 


Dp: I] D—— E, 


Dec 
r: DoE 


(11) 


where the component of p with index r sends D into E via r. This 
coproduct is indeed indexed by a small set, since by assumption C is 
small and € has small Hom-sets. Notice that the map p in (11) is the 
same as the map p in (10), because A was defined to be the inclusion 
functor C — € [except that we have changed the notation from C' in (10) 
to D in (11)]. By exactly the same argument as in the proof of (7) for 
Lemma 1 (but with C’ there replaced by £) it follows that the present 
epimorphism p fits into a coequalizer diagram of the form 


(12) 


where the first coproduct ranges over all the commutative squares i in 
E of the form 


B—*t—-.D 


a: | I (13) 


D' —_— E, 


where the objects B, D, D’ are in C, while the maps a and of (12) 
are defined on the summand indexed by such a square by using the 
maps h and k, respectively. We claim that, for any object X of E, an 
arrow f: [[p,,D — X coequalizes a and G in (12) iff it coequalizes 6 
and 7 in (10). This claim, once verified, will show that a, @, and 0, 7 
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have isomorphic coequalizers, so that the map eg of (10) is indeed an 
isomorphism, as stated in the lemma. 

To check this claim, take such an arrow f with its components 
fp,7: D- X for all D€ C and allr: D> Ein E. Now foa=foB 
means that fp,,oh = fp,,, ok for each commutative square of the form 
(13). On the other hand, f 06 = f o7 means that fp,ou= foru 
holds for every arrow u: D’ + D in C and every r: D —-» E. Therefore 
f@ = fr implies for every square (13) that 


fo Oh = fan = fayk = for Ok; (14) 


therefore, fo = f@. Conversely, if fa = f@, then for every u: D’ + D 
in C and every r: D — E the square 


of the general form of (13) yields fp,-ou = fp/ru. Therefore, f0 = fr. 
This completes the proof of Lemma 2. 


Lemma 3. The functor Hom,(A, — ): € —> Sh(C, J) preserves epis. 


Proof: Consider an epimorphism u: E’ — E in € and an element 
r: C —» E of Hom,(A, E)(C) = Homg(A(C), E). Because the objects 
of the category C generate €, the pullback E’ x C along u is covered 
by an epimorphic family of arrows B — E’ xz C with each domain B 
in C, as in the (pullback) diagram 


B— E' xp C ——+C 


| | (BEC). 


E’ —_.—- E 
But u is epi, hence so is the pullback E’ x~ C — C, by the stability 
assumptions (iv) on €. Therefore the family of all composites B —> 
E'x CC —» C is again an epimorphic family. Thus we have constructed a 
cover of C’ in the site (C, J) by arrows h: B — C so that each composite 
roh: B-» E is in the image of the map 


ux: E(B, E’) ——> €(B, E) 


given by composition with u: E’ — E. In the terminology of §IIL.7, 
this means that the natural transformation u,: E(—,E’) — E(—,E) 
is “locally surjective”; hence, by Corollary ITI.7.5, it is an epimorphism 
between sheaves on C. Since u: E’ — E was an arbitrary epi in €, this 
proves that the functor # ++ Hom,(A, EF) = E(A(—), E) preserves epis. 


584 Appendix: Sites for Topoi 


Lemma 4. The functor Hom,(A, —): € — Sh(C, J) preserves co- 
limits. 


Proof: Since all colimits can be constructed from coproducts and 
coequalizers, it suffices to prove that the functor Hom,(A, — ) preserves 
these. Consider first an equivalence relation R C E x E in € with its 
coequalizer 


R—— E—»+E/R. 


By the exactness assumptions on €, RF is the kernel pair of FE» E/R. 
Hence, because Hom,(A, —) is left exact, the two left-hand parallel 
arrows in (15) below form the kernel pair of the right-hand arrow: 


Hom, (A, R) ———} Hom, (A, FE) ——> Hom, (A, E/R). (15) 


But this right-hand arrow is epi by Lemma 3. All this takes place 
in the (Grothendieck) topos Sh(C, J) where we already know (by the 
first half of Giraud’s theorem) that epis are coequalizers of their kernel 
pairs. Hence (15) is indeed a coequalizer diagram. Since we now know 
that Hom,(A, — ) preserves coequalizers of equivalence relations, Corol- 
lary 2.3 shows that Lemma 4 will be proved if we show that Hom,(A, — ) 
also preserves small coproducts. 

First recall from Chapter III that for any (small) family { F,} of 
sheaves in Sh(C, J), their sheaf coproduct |], is constructed as 


|] ~ a(| | () F,), 


where W® ) denotes the coproduct in Sets©” (the pointwise disjoint 
sum), and a: Sets©’” — Sh(C, J ) is the associated sheaf functor. 

Now consider any coproduct E = |], Ea in the given category €, 
with its coproduct inclusions i,: E, — E. By applying the functor 
Hom,(A, —) to these coproduct inclusions, we obtain the map ¢ of 
presheaves, as in the top row of (16) below, and from it the associated 
map ad of sheaves. In this diagram, 7) is the unit of the associated sheaf 
adjunction; it is an isomorphism on the right since Hom,(A, ||, Ea) 
upper right is a sheaf by Lemma 1. Here is the diagram (in the presheaf 
category): 


[ | ®Home(A, Ea) ——*_, Hom, (A, [| ]£2) 


| ee ao h (16) 


| | Home(A, Ea) —za- 7 aHom, (A, | ]£.), 
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where we have written ¢ = 77! 0a(¢). We will now show that a¢ is an 
isomorphism of sheaves. 

First we prove that ag is mono. To this end, consider at each object 
C of C the component ¢¢ of ¢; since the functor A is the inclusion 
C —> €, this component is the map of sets 


oc: I] Home (C, E.) ——> Home (C, I] Ea) (17) 


induced by the given coproduct inclusions ig: E,~—~ |] Eq. Let K de- 
note the kernel pair of ¢, as a map of presheaves. In this category, limits 
such as kernel pairs are computed pointwise, so each K(C) is the kernel 
pair of ¢c in Sets. Thus, it consists of pairs of elements (a,r: C > Eq) 
and (3,8: C + Eg) with 


éc(a,r) = b0(8,8); ie, igor =igos: C | |Ea. (18) 


[So these are just the usual pairs (a,r) and (@,s) used to test whether 
¢éc is monic.| Now by the exactness hypotheses of Giraud’s theorem 
the coproducts in € are disjoint. Hence, by the second equation of (18), 
either a = @ and r = 5, or the pair (r,s) constitutes a map A(C) — 0 
since Ey x gEg is the initial object 0 of € whenever a # G. In other 
words, gc is injective except perhaps in the case when there is a map 
A(C) — 0. But the initial object 0 is the coproduct of the empty family, 
hence the stability of coproducts in € implies that any map C' — 0 
shows that C is also isomorphic to the coproduct of the empty family, 
so that C = 0. It follows that the kernel pair K of ¢—a subobject of 
(y?. ) Hom, (A, Eq))?—is mapped by the unit 7 into the diagonal, as in 


K 


(L]  Home(A, Ea)? 


ira shes ece a 


law (19) 


[ | Home (4, Fa) x (| ] Home (A, Ba))?. 


Indeed, we have just shown that K(C) is already contained in the diag- 
onal of ([], €(C, Ea))? unless C ~ 0. But in the case where C' = 0, the 
empty family is a cover of C in the chosen Grothendieck topology J on 
C. But then any sheaf on C must take C to a singleton set; in particu- 
lar this applies to the sheaf |], Hom,(A, Eq). Thus also for C 0, the 
component yo x 7c sends K(C) into the diagonal. By applying the asso- 
ciated sheaf functor to the subobject K one concludes that the resulting 
subobject ak of ([], Hom,(A, E.))” is contained in the diagonal there. 
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But the associated sheaf functor a is left exact, so takes the kernel pair 
K of ¢ to the kernel pair aK of ag. The fact that ak is contained in 
the diagonal A as pictured above means exactly that ad is mono. 

Finally, we show that the map a¢ in (16) is epi. By Corollary III.7.6 
it will be enough to prove that the map ¢ of presheaves is locally surjec- 
tive for the given topology J. To this end, consider again the component 
gc at each object C of C, as in (17). For any arrow r: C — |] Eq con- 
struct for each inclusion i, the pullback P,, 


Bie Spe ee ae, 


| - (20) 


C—— | | Ea. 


By the basic assumption that the objects B of C generate €, the indi- 
cated arrows B — P, with domain B in C form for each fixed @ an 
epimorphic family into P,. Moreover, the assumption of the theorem 
that coproducts are stable under pullback implies that C’ in (20) is the 
coproduct C' = |] Pa with coproduct inclusions j,. In particular, these 
aIrows ja: Py — C also constitute an epimorphic family in €. There- 
fore, finally, the family of all composites 


Reps a (BEC) (21) 


is an epimorphic family, thus a covering sieve on C in the Grothendieck 
topology J (defined as in (4) above). 

But the restriction of r: C — ]] Eq along any such composite (21)— 
that is, the composite rj,w: B — |] Eg—is by commutativity of (20) in 
the image of Hom,g(B, Ey.) + Home(B, || E.) and hence in the image 
of ¢p in (17). This shows that ¢ is locally surjective for the topology 
J; hence, by Corollary III.7.6, the map a@ of associated sheaves is epi. 
This completes the proof that the functor Hom,(A, —): € — Sh(C, J) 
preserves coproducts, so Lemma 4 is proved. 


The following result will now complete the proof of Giraud’s theorem. 


Lemma 5. For every sheaf R on the generating category C the unit 
nr: R — Hom, (A, R @c A) of the adjunction 1 is an isomorphism. 


Proof: The functor R: C°P — Sets can be written as a col- 
imit of representable functors y(B) = C(—,B): C°P — Sets, as in 
81.5. According to Lemma | above, these representable functors are 
all sheaves. But for each such representable functor y(B), the unit 
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Ny(B): y(B) — Home,(A,y(B) @c A) is easily seen to be an isomor- 
phism. Indeed, representables are units for the tensor product, as in 
VII.4(4). In particular, for each object B of C the map 


A(B) —~—> y(B) @c A, at 1a, (22) 


for a € A(B), is an isomorphism. Furthermore, since the functor 
A: C — €E is simply the embedding of the full subcategory C into €, 
there is an evident isomorphism of presheaves 


y(B) —— Hom, (A, A(B)). (23) 
Thus one obtains a diagram 
~*~ 7 Home (A, y(B) @c A) 


Hom, (A, A(B)) 


with the two isomorphisms of (22) and (23). By the explicit description 
of the unit 7 = nycz) in (8) [for the special case where Q is y(B)| 
this diagram is readily seen to commute, so that 7 must also be an 
isomorphism. 

To conclude the proof of the lemma, we observe that the functor 
Rr RScA preserves colimits since it is a left adjoint, while the func- 
tor Hom,(A, — ) preserves colimits by the preceding lemma. Therefore, 
so does their composite nr: R +» Homg(A, R &c A). Hence, since any 
sheaf FR is a colimit of representables and npr is an isomorphism for repre- 
sentable R, the naturality of the unit 7 yields that 7 is an isomorphism 
for any sheaf R. This completes the proof of the lemma. 

We have now shown that the adjoint pair of functors (1) restricts (by 
Lemma 1) to an adjoint pair of functors Sh(C, J) @ €, and that the unit 
and counit of this restricted adjunction are isomorphisms (Lemmas 5 
and 2). Therefore, this restricted adjunction constitutes an equivalence 
of categories Sh(C, J) = €, so that E€ is indeed a Grothendieck topos, 
thus completing the proof of Giraud’s theorem. 


4. Some Consequences of Giraud’s Theorem 


The construction of the site (C, J) in the previous section may be 
formulated thus: 


Corollary 1. If C is a small full subcategory of a Grothendieck 
topos € which generates E, while J is the (subcanonical) topology on C 
in which the covering sieves on an object C' are the epimorphic families 
to C, then E is equivalent to Sh(C, J). 
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Moreover, the equivalence is given by the functor 
E-+Sh(C,J), E+ Home(—, £). (1) 


Corollary 2. Every Grothendieck topos € has a small site which is 
subcanonical and closed under (any subset of) the following operations: 
finite limits, exponentials, countable colimits, subobjects, quotients by 
equivalence relations. 


Proof: Take a small site (C, J) for €. We may assume that C is a 
full subcategory of € and, by Lemma 3.1, that the topology is subcanon- 
ical. Then take the full subcategory D of € which is the closure of C 
under all of (or any selection of) the operations listed in the statement of 
the corollary. This closure D is still (equivalent to) a small subcategory 
of €, and D generates E since C already generates €. By Corollary 1, the 
category D, equipped with the topology given by epimorphic families, 
is a subcanonical site for €. 

Since the site for a Grothendieck topos is not uniquely determined 
by the topos, it is useful to examine the relations between different sites. 

We now formulate a comparison between sheaves on a given site and 
those on a smaller related site. If (C, J) isa site and A is a subcategory 
of C, one says that a covering sieve S on an object C of C is a “cover 
by objects from A” when every arrow C’ — C of the sieve S factors as 
C’ —+ A —- C for some object A of A. When such a sieve S exists, C 
is said to be covered by objects from A. The desired comparison may 
now be formulated as follows: 


Corollary 3 (“The Comparison Lemma”). For a subcanonical 
site (C, J), let A be a full subcategory of C for which every object of C 
has a cover by objects from A. Define a topology J’ on A by specifying 
that a sieve S on A is a J'-cover of A iff the sieve (S) which it generates 
in C is a J-cover of A. Then the restriction functor Sets©”” + Sets*”” 
induces an equivalence of categories 


Sh(C, J) —~— Sh(A, J’). 


(There are sharper versions of this comparison lemma, for which J 
need not be subcanonical and A C C need not be full; see, e.g., [Kock, 
Moerdijk, 1991].) 


Proof: Let € = Sh(C,J), and write as usual ay: C — € for the 
Yoneda embedding followed by sheafification. Recall that the topology 
J can be recovered from the category € of sheaves (Corollary III.7.7) 
by the statement that a family { f.: Ci + C'} covers C iff the induced 
family {ay(C,) — ay(C)} is an epimorphic family in the category of 
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sheaves. Moreover, since the topology J is subcanonical, the repre- 
sentable presheaves y(C’) here are already sheaves, so ay(C’) = y(C), 
for any object C' € C. Consider now the subcategory A. The hypothesis 
on coverings by objects of A implies that for each object C’ in C there is 
a set A; of objects from A such that there is a family of maps yA; — yC 
which is epimorphic in the category of sheaves. Since C generates €, so 
does A. Now take the topology J’ on A to consist of those sieves S 
which are epimorphic families in €. Then by Corollary 1 we have an 
equivalence € = Sh(A, J’). 

To see that this topology J’ is as described in the statement of the 
corollary, consider such a sieve S on an object A of A for which the map 


Lly@) — (4) 


is epi in €, where u ranges over the arrows u: B — A in S. Then by 
Corollary III.7.5, there is a J-cover T on A in the site C for which every 
v: C -» Ain T factors through some arrow u: B —> Ain S. This means 
that T is contained in the sieve (S) generated in C by the arrows in S. 
In particular, (S) is also a J-cover in C. Conversely, consider a sieve S 
on an object A of A for which (S) is a J-cover in C; this means that 
there are arrows h;: A; —> A of S and k,;: Ci; — A; such that the 
whole family h; ok; is an epimorphic family in €. It then follows that 
the h, yield an epimorphic family in €. In other words, (S) a J-cover 
implies that S is a J’-cover. It follows that the topology J’ is indeed 
as described in the statement of the corollary. Moreover, the functor 
E = Sh(C, J) > Sh(A, J") giving the equivalence of Corollary 1 is clearly 
the functor which restricts a sheaf on C to one on the subcategory A. 

The utility of this comparison lemma may now be illustrated in sev- 
eral cases: 

(a) If X is a topological space with the standard notion of open 
coverings, then any basis B C O(X) for the topology of X does satisfy 
the hypothesis of the comparison lemma, simply because any open set 
is the union of open sets of a basis. As a consequence, one obtains a 
new proof of Theorem II.1.3, describing sheaves in terms of a basis. As 
stated there, it follows that a sheaf F’ on the space X may be defined 
(uniquely up to isomorphism) by specifying the values F(B) only for the 
open sets B of the basis. When the basis B is closed under intersections 
[which are pullbacks in O(X), regarded as a category], such a functor F 
is a sheaf iff 


F(B) —— || F(8) =3 |] F(4.08B;) 


is an equalizer, for any cover of an element B of B by basis elements B; 
[as in §II1.4(5)]. 
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(b) For a natural number n, let M,, be the category of all C™- 
manifolds of dimension n, equipped with the usual open cover topology 
described in 8III.2. Since any n-manifold M is locally diffeomorphic to 
R” and since any M is covered by charts of this form, the comparison 
lemma implies that the category of sheaves on M,, is equivalent to the 
category of sheaves on the site with only one object, the Euclidean space 
R”, and with all smooth functions R” — R” as arrows, in which the 
covers are simply families { f;: R" — R”} of open embeddings which 
cover R”, in the sense that R” = ), f;(R”). Notice that a sheaf on this 
site is a set, equipped with an action by the monoid of smooth functions 
R” — R”, and satisfying a suitable sheaf condition. 

(c) Consider the double negation (or, the dense) topology on a poset 
P, as in 8III.2 example (e). The poset P is called “separative” ([Jech 
1978]) or “refined” ([Bell 1977]) if it has the following property 


(i) ¢ Z p implies that there exists some r < p such that s < r implies 
sq. 
For such a poset one may prove (much as in the special case of the Cohen 
poset treated in §VI.2) that every representable functor P(— ,p): P°? 
Sets is a sheaf; thus the topology is subcanonical. Now recall that an 
ideal U in a poset P is a subset U C P such that 


(ii) For p, gE P,p<qe€U implies pe U. 
Also an ideal U is closed when in addition 
(iii) For a set D C P dense below pe P, DCU implies p € U. 


(Recall that a subset D is said to be dense below p if for any g < p there 
exists an r € D with r <q.) Any ideal U of P is contained in a smallest 
closed ideal U, defined by 


peU iff{q|q<pandq€U} is dense below p. 


The intersection of two closed ideals is closed, while each family of closed 
ideals U; has as a supremum, the closure of the union. In fact, one easily 
shows that the closed ideals in P form a complete Boolean algebra B(P). 
When P is separative, as in (i), every principal ideal (p) = {q|q€ 
P,q < p} is closed. Hence there is an embedding 


i: P+ B(P), pr (p). (2) 


Now on the category P take the dense topology, and on the complete 
Boolean algebra B(P) of closed ideals take the topology where a cover 
is a (possibly infinite) supremum, as for cHa’s in §III.2 example (d). 
Then for any ideal U, 


U=\V/{(p)|pev}, 
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so every object of the large site B(P) of (2) is covered by images i(p) = 
(p) from the smaller site P. Moreover, any sieve U on p [that is, any 
ideal U C (p)] gives a cover V{ (gq) | ¢ € U} = (p) of i(p) in B(P) just 
when this sieve U is dense below p, that is, exactly when U covers p in 
the dense topology. Therefore, the topology on P induced by 7 is indeed 
the dense topology. The comparison lemma thus yields an equivalence 
of sheaf categories 


Sh(P) —~— Sh(B(P)). 


This states that any model of set theory constructed (as in the method 
of Cohen, Chapter VI) by sheaves on a separative poset can also be con- 
structed by sheaves on the associated complete Boolean algebra. Briefly, 
this means that forcing 4 la Cohen has the same content as Boolean- 
valued models. 

As another application of Giraud’s theorem, one obtains the follow- 
ing result which compares the Grothendieck topoi of Chapter III with 
the elementary topoi of Chapter IV. (Recall that every Grothendieck 
topos is an elementary topos.) 


Proposition 4. An elementary topos € is a Grothendieck topos iff 
E has all small coproducts and a small set of generators. 


Proof: Clearly any Grothendieck topos is an elementary topos with 
coproducts and a set of generators |cf. the easy direction (=) of Giraud’s 
theorem]. For the converse, suppose € is an elementary topos with small 
coproducts and a set of generators. We will check that € satisfies the 
conditions (i)-(v) of Giraud’s theorem. Condition (v) is satisfied by as- 
sumption. € also satisfies condition (i), since coproducts in a topos are 
disjoint (Corollary IV.10.5), and they are preserved under pullback since 
pullback functors have right adjoints (Theorem IV.7.2). Furthermore, 
in an elementary topos € every epimorphism B — A is the coequalizer 
of its kernel pair (Theorem IV.7.8), hence gives rise to an exact dia- 
gram B x, BB — A. This diagram remains exact after pulling back 
because epis in a topos are stable under pullback (Proposition IV.7.3). 
This shows that € satisfies conditions (ii) and (iv) for Giraud’s theo- 
rem. Finally, to verify condition (iii), consider any equivalence relation 
RCE-x Ein E. Since finite colimits exist in any topos (Chapter IV), 
the coequalizer E + E/R of RE exists in €. It remains to show 
that R is the kernel pair of its coequalizer HE + E/R. An easy diagram 
argument shows that if R is the kernel pair of any arrow E — D, then 
it must also be the kernel pair of its coequalizer E — E/R. Therefore, 
the following lemma completes the proof of the proposition. 


Lemma 5. In a topos €, any equivalence relation R C Ex E is the 
kernel pair of some arrow E > D. 
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Proof: Let (00,01): R— E x E be an equivalence relation, let 
Xr: Ex E 92 be its characteristic map, and let ¢: E + Q* be the 
transpose of yr. We claim that A is the kernel pair of ¢. (The reader 
may wish to check first that this is indeed the case when E€ = Sets.) 
First we show ¢0) = $01. Since R is symmetric, there is the usual 
“twist-map” +: R — R which makes the diagram 


R————— R 


wa, jo 90) 


ExE 


commute. Also recall the pullback (3) from the first section, repeated 


here as ReR—™>R 


Th ° 


R—— E. 
ao 


Since R is transitive, the map (0971,0i72): R* R > E x E factors 
through R, and we denote this factor by p: R* R — R (so Op = 
0071, 01 = 0172). Now from the pullback (3), one readily deduces by 
elementary diagram-arguments that the following two squares are also 
pullbacks: 


RxR—™=_.R Rx R—™_, 
Consra [co eran |e (4) 
ExR—ly7 EXE, ExR—WZo EXE. 


But the two subobjects of E x R appearing on the left of these diagrams 
are isomorphic, as follows from the commutativity of 
B 


Rx Rk ————F RR 


o a= (tr, Tp) 5 
(8071,72) (81 72,71) B = (rp, 71); ( ) 


ExR 


the notation on the right means that a: RxR — R*R is the unique arrow 
with 11@ = m2 and 12a = 7p, and similarly for @. It then follows readily 
that the triangle (5) with a—and also that with G-—is commutative. To 
show that Go is the identity, use 71(8a) = rp(72,7p) and hence that 


Oo(m1B)a pace O1p(Te, Tp) = O12 (To, Tp) 
= OT p = Op = Oo™1, 
01 (11 B)a = Oop(r2, TP) = Oom1 (Tr, Tp) = Oke = Om. 
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Therefore, by the pullback (3), mGa = 71, while 728a = 72 is immedi- 
ate. Thus fa = (7,72) = 1, as desired. The proof that af = 1 is dual 
to this. 

It follows that these two isomorphic subobjects of E x FR in (5) have 
identical characteristic maps. Juxtaposing both pullbacks in (4) with 
the pullback 


R——— 1 


(80,01 ] | (6) 


Exh 0, 


we find that these characteristic maps are exactly yr o (1 x Q) and 
xr 0° (1 x 01), respectively. But when these maps are equal, then so are 
their transposed maps ¢0 and ¢0,: R + 0". 

Next, to show that (09,01) is the kernel pair of ¢: E — 0, take 
any object X and any arrows f, g: X — E such that df = dg. We need 
to find an arrow h: X — R such that Ooh = f and 01h = g. Such anh 
is necessarily unique since (0),0,): R + E x E is monic. Consider the 
two pullbacks of R along 1 x f and 1 x g, as in the diagram 


ExX—Siex EX BxXx 


«| Jian | (7) 


Py eS R a Py. 
Juxtaposing each of these pullbacks with (6), taken upside down, one 
finds that the monos ay and a, have the respective characteristic maps 
xro(1x f) and xro(1 xg): Ex X +. The transpose X > 0 
of these maps are ¢ o f and go g, respectively. Hence, since df = dg 
by assumption, also yr o (1 x f) = xr° (1 x g). So Py and P, are 
isomorphic as subobjects of E x X, say by an isomorphism 0: Ps —> P, 
with a,9 = as. Now consider the map (f,1): X + Ex X. Since 
(f,f): Ax fyo(f, 1): X - Ex E factors through the diagonal A: E > 
Ex E, hence through R C Ex E by reflexivity, it follows from the left- 
hand pullback in (7) that (f,1) factors through ay, say as (f, 1) = afk. 
But then (f, g) = (1xg)o(f,1) = (Ixg)ask = (1xg)ag0k = (Oo, 01)bgOk. 
So h = ,6k is the arrow X — R with the property that Oph = f and 
Oh = g, as required. This proves the lemma, and so completes the proof 
of Proposition 4. 


Giraud’s Theorem often enables one to recognize a certain category 
as a Grothendieck topos, even in cases where an explicit description of 
a site may not be immediately available. A typical example is the case 
of equivariant sheaves. 
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Let G be a topological group acting continuously on a topological 
space X, say from the left. Write 


pi Gx xX — X, L(g.) =g-2 


for the action map. A G-space over X is a space p: KE — X over X with 
an action of G on E such that p respects this action, as in 


GxEp—_S8 


Gx X--—> x. 


A map of G-spaces over X is simply a map of spaces over X which 
respects the G-action; thus, there is a category of G-spaces over X. 
Such a G-space over X is called étale if the map p: E —» X is an étale 
map. 

Recall from Chapter II that sheaves on a space X may be identified 
with étale spaces p: E —> X over X. We define a G-equivariant sheaf 
on X to be an étale G-space over X, and write 


She(X) 


for the category of such equivariant sheaves; it is a full subcategory of 
the category of G-spaces over X. 


Proposition 6. For any continuous action of a topological group G 
on a space X, the category Shg(X) of G-equivariant sheaves on X is a 
Grothendieck topos. 


Proof: Consider the faithful forgetful functor 
U: Shg(X) — (Btale /X) = Sh(X) 


(“forget the G-action”) from G-equivariant sheaves to étale spaces over 
X. If E+ X and F -» X are étale G-spaces over X, then their product 
in the category of étale spaces over X, ie., their pullback Ex x F + X, 
has an obvious G-action which makes it into the product in the category 
Shg(X). In other words, the functor U creates products. In the same 
way, one shows that U creates all finite limits and all colimits. Conse- 
quently, Shg(X) inherits all the exactness properties from the category 
of étale spaces over X (i.e., of sheaves on X). So Shg(X) satisfies all 
the conditions of Giraud’s theorem, except perhaps the last condition 
concerning generators. 

In order to show that Shg(X) has a set of generators, consider any 
étale G-space p: E — X. Then since p: EF —+ X is étale, there is for each 
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point e € Ea section s: U — E over some open set U C X such that e 
lies in the image of s. In other words, the set of all sections s: U + E 
of p is an epimorphic family to E. For each such section s: U + E, 
let G- s(U) = {g- s(x) | x € U,g € G} be the closure of the subset 
s(U) C E under the action of G on E. This set G-s(U) is an open subset 
of E because s(U) C E is open (any section of an étale space is an open 
map), and hence so is its translation g-s(U) = {g-s(z)|r@E@U}CE 


~ 


under any homeomorphism e+> g-e: E —+ E. Hence the union G-s(U) 
of all these translates g- s(U), for all g € G, is also open. It follows that 
the restriction p|(G + s(U)) of p: E — X to this open subspace is again 
étale. Thus, we get a commutative diagram 


G+ s(U)—_—- E 
now, p 
x 
in the category of étale G-spaces over X. Since G- s(U) C E contains 
the image of s: U — E, it follows that the collection of all these G-maps 
G-s(U)—E, for all sections s on all open subsets U C X, form an 
epimorphic family in the category Shg(X). Therefore, the collection 
of all étale G-spaces of the form G- s(U) -+ X generates the category 
Shg(X). But there is only a set of such étale G-spaces, up to isomor- 
phism. For the surjection (g, 2) > g-s(x) displays G-s(U) as a quotient 
of G x U, and clearly, up to isomorphism, there is only a set of such 


quotient spaces G x U with U an open subset of X. This shows that 
Shg(X) has a set of generators. 


Notice that for the case where X is the one-point space, the category 
Shg¢(X) is the category of continuous left G-sets. The site produced for 
this category by Giraud’s theorem is exactly the site constructed in §II1.9 
(apart from the fact that we considered right actions there). 
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In this epilogue we will make some suggestions for further reading 
related to topos theory. We do not at all aim to provide an exhaustive 
description of the available literature, but only wish to mention some 
useful books and articles in each of the various directions described be- 
low. 

Background in Category Theory and Topology. In our book, 
we have assumed only a minimal acquaintance with category theory. 
Some of the authors mentioned below assume quite a bit more, so a 
reader might wish to deepen his understanding by consulting one of 
the several general texts available, such as MacLane (“CWM”, 1971), 
Pareigis (1970), Schubert (1970), or Freyd, Scedrov (1990). Various 
texts on categorical topology and on categories as used in computer sci- 
ence are not really relevant, at least for our purposes. Fibrations—or the 
essentially equivalent notion of indexed categories—occur frequently in 
topos theory. Paré and Schumacher (1978) describe indexed categories; 
Gray (1966) has an extensive description of fibrations, while Bénabou’s 
article (1985) provides some controversy as well as a good list of ref- 
erences on fibrations. An early article (1967) of Bénabou gives a good 
introduction to the useful notion of a bicategory. For closed categories 
(those with an internal hom-functor and the corresponding tensor prod- 
uct) one may consult Kelly (1982). The latter book also covers enriched 
categories (those where the hom-functor takes values in a closed cate- 
gory). For double categories, n-categories, and the newer w-categories, 
one may consult Kelly, Street (1974) or Street (1987). 

Categories arose originally in topology and have their first applica- 
tion in axiomatic homology theory, in the famous book by Eilenberg and 
Steenrod (1952). The connection with homotopy theory and simplicial 
sets is explored in Gabriel, Zisman (1967); the same source has a good 
description of categories of fractions. Two other useful introductions to 
the theory of simplicial sets are May (1967) and Lamotke (1968). Among 
general texts on topology, we mention Dold (1972), Adams (1972), and 
Massey (1991). For the history of algebraic topology, consult the com- 
prehensive book by Dieudonné (1989); for that of category theory see 
an article by Mac Lane (1988). 

Background in Sheaf Theory. Sheaf theory started in complex 
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analysis and was developed chiefiy for its use in defining the cohomology 
of suitable spaces. Chapter II has described sheaves on spaces; we have 
omitted sheaf cohomology—but with great regrets. The short book by 
Tennison (1975) may serve as a good introduction. The earlier book by 
Swan (1964) is also short, more sophisticated, and clear, while the clas- 
sical text on sheaves in topology is still Godement (1958). Another com- 
prehensive introduction is Iverson (1986). The role of sheaves in homol- 
ogy (Borel-Moore homology) is described here and in Dieudonné (1989). 
J. Gray has an extensive article (1979) on the history of sheaves. Fa- 
mous papers by Serre (1955, 1956) pioneered the introduction of sheaves 
in algebraic geometry. 

Since much of sheaf theory (and of topos theory) is concerned with 
cohomology, many of the references in this direction require some back- 
ground in homological algebra. This is the title of the famous first 
book by Cartan and Eilenberg (1956). More accessible introductions are 
Hilton, Stammbach (1971) and Rotman (1979). The earlier introduction 
by Mac Lane (1963) is more encyclopedic (for its time). Grothendieck’s 
fundamental paper (1957), usually cited as “Tohoku”, is still well worth 
reading for its exciting discovery that abelian categories and homological 
algebra apply to sheaves. 

Algebraic Geometry. In the modern treatment of algebraic ge- 
ometry (since Grothendieck) sheaves and schemes play a central role. 
There is an introduction to schemes by Mac Donald (1968), a leisurely 
presentation of algebraic geometry in Shafarevich (1977), and a more en- 
cyclopedic presentation in the text of Hartshorne (1977). Mumford’s fa- 
mous introduction (The Red Book) has finally appeared in the Springer 
Lecture Notes. There is a more categorical description of schemes as 
given in the context of algebraic groups in Demazure, Gabriel (1970), 
and in SGA3. Grothendieck topoi arose in algebraic geometry as a way 
to define cohomology theories which would be suitable to solve the fa- 
mous Weil conjectures; étale cohomology and crystalline cohomology are 
two examples of such theories. The original, systematic and exhaustive 
treatment is that given in SGA4 by Grothendieck and his school, but 
many readers may well be discouraged to face the 1600 pages of this 
three-volume work, while subsequent books by Deligne and Milne man- 
age to thoroughly hide the categorical and topos-theoretic connections. 
Artin’s early notes (1962) on Grothendieck topologies are more accessi- 
ble; Illusie (1972) is also a standard source. A more recent exposition 
of étale cohomology is given in the book (1988) by Freitag and Kiehl. 
Etale cohomology is the cohomology of the so-called étale topos associ- 
ated to a given scheme or variety. This cohomology does not function 
well for p-torsion abelian sheaves in case the prime p is also the char- 
acteristic of the variety; for recent attempts to deal with this case see, 
e.g., Ogus (1990). Among the Grothendieck topologies other than the 
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Zariski, étale, and crystalline ones, we mention the one recently intro- 
duced by Nisnevich (1989). The literature on nonabelian cohomology is 
also considerable; for the recent state of affairs as well as many references 
one may consult Breen (1990). 

General Reading on Elementary Topoi. A standard reference 
is Johnstone (1977); it contains most of the material on elementary topoi 
known at the time of its publication. The presentation is terse and re- 
quires some sophisticated category theory—it is not a book to read on 
the beach. Inevitably it does not include some of the more recent de- 
velopments, and so does not present the use of locales which now play a 
central role in topos theory. A more recent introduction to topos theory 
is the book by Barr and Wells (1985), which is chiefiy written from the 
viewpoint of categories, rather than that of logic or geometry. It starts 
with a nice introduction to category theory, and an excellent descrip- 
tion of monads (there called triples) and their algebras. Ehresmann’s 
“sketches” are also discussed, and good use is made of representation 
theorems, in the style pioneered by Freyd in his (1972), still well worth 
reading. The recent Freyd, Scedrov book (1990) contains many use- 
ful insights connected with topoi and with categories of relations. The 
rapid presentation is original in form and content. Older references are 
the good survey paper by Wraith (1975), and the earlier lecture notes by 
Kock and Wraith (1971), still available from Aarhus University. Only 
cognoscenti will be able to get at influential early notes—by Tierney at 
Varenna (1971), by Bénabou in his seminar (1970), and the legendary 
Perugia-notes of Lawvere (1973). By all means look at the first presenta- 
tion of elementary topoi in Lawvere’s paper at the 1970 Nice Congress. 

Among the many recent developments, we will mention literature in 
several directions in the paragraphs below. 

Topoi and Mathematical Logic. An elementary topos can be 
viewed as a model of some intuitionistic version of higher-order logic. 
This aspect is discussed in Boileau, Joyal (1981) and extensively in the 
book by Lambek and Scott (1986). The latter presentation is a little on 
the formal side, but it contains several nice applications of topos theory 
to the proof theory of intuitionistic higher-order logic. There is also a 
careful discussion of the intimate relation between cartesian closed cate- 
gories and the typed lambda-calculus, based on the observation that the 
adjunction between product and exponential is essentially an application 
of the lambda operator. 

A more recent book by J. L. Bell (1988) provides a systematic pre- 
sentation of topos theory from the point of formal logic. Thus, Bell 
introduces a version of the Mitchell-Bénabou language very early, and 
then proceeds to prove the standard facts about an elementary topos 
(our Chapter IV) in a strictly formal style, with most inferences explic- 
itly exhibited, much as in the Gentzen calculus. This should be attrac- 
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tive to those readers comfortable with such styles. Bell’s last chapter 
discusses some of Lawvere’s insights concerning the philosophy of our 
subject. 

In the direction of foundations we quote first of all Lawvere’s 1964 
article “An elementary theory of the category of sets” which proposed 
a new and strictly categorical foundations for mathematics. The pro- 
posed axiomatics now takes the form of the axioms for a well-pointed 
topos, as presented in our Chapter VI. The relation of these axioms to 
those for (weak) Zermelo set theory was explored in Cole (1973) and 
Mitchell (1972) using trees as in our Chapter VI. Further foundational 
aspects are discussed in Mac Lane (1986) and in Mathias (1987); much 
remains to be clarified and extended. 

Cohen’s (1963) use of forcing for independence proofs in set the- 
ory is intimately related to sheaf theory, as first observed by Law- 
vere and Tierney; see Tierney (1972). For the background in forcing 
there are now many texts on set theory available, of which we men- 
tion Jech (1978) and Kunen (1980), and, for Boolean valued models, 
Bell (1977). An earlier reference by Fitting (1969) also discusses the 
connections between forcing and Kripke semantics. Fourman’s paper 
(1980) discusses the relation between sheaves and forcing, and gives a 
construction in any Grothendieck topos of (an intuitionistic version of) 
the standard set-theoretical hierarchy. This construction is also used by 
Freyd (1980) in his beautiful proof of the independence of the axiom of 
choice (presented in our Chapter VI). An exposition of Freyd’s methods 
as well as a comparison to standard set-theoretical approaches appears 
in Blass, Scedrov (1989); Solovay, unpublished, has done related studies. 
M. Bunge (1974) describes the proof of the independence of the Souslin 
conjecture in topos-theoretic terms. 

In addition to the general connection with intuitionistic logic [as in 
Lambek, Scott (1986)] there are also applications of topos theory to spe- 
cific questions of consistency and independence in intuitionistic analysis, 
such as our discussion in Chapter VI of Brouwer’s theorem on contin- 
uous functions. An exposition of some results in this direction can be 
obtained from Fourman, Hyland (1979); for a more elementary and ex- 
tensive exposition, one may consult Chapters 14 and 15 of Troelstra, 
Van Dalen (1988). The relation between topos theory and Kleene re- 
cursive realizability is discussed in Hyland (1982), where the “effective 
topos” is introduced. 

Another connection between topoi and logic is that between classify- 
ing topoi and geometric theories (our Chapter X). Early sources related 
to classifying topoi are Hakim (1972) and Tierney (1976). The mono- 
graph by Makkai and Reyes (1977) gives an extensive presentation of the 
theory of classifying topoi, and the relation between a geometric theory 
and its category of models. 
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Locales and Topoi. As we have noted, locales (“pointless spaces” ) 
play a central role in topos theory. An introduction to locales is pro- 
vided by Johnstone’s book (1982), and by Joyal and Tierney in the first 
part of their (1984) paper, referred to as JT. But beware: what we and 
Johnstone call a frame and a locale is in JT called a locale and a space, 
respectively. The main result of JT is that every Grothendieck topos is 
equivalent to a topos Shg(X) of equivariant sheaves, as in our Appendix, 
§4, except that X isa locale rather than a topological space, and G is not 
a group acting on X, but (more generally) a groupoid in the category of 
locales, with X as locale of objects. Such a groupoid is also called a con- 
tinuous groupoid. This result is strengthened in Moerdijk (1988), where 
it is shown that topoi can be obtained by a calculus of fractions from 
continuous groupoids, and in Joyal, Moerdijk (1990), where it is shown 
that it suffices to consider continuous groupoids consisting of homotopy 
classes of paths (much like the fundamental groupoid of a space), and 
where cohomological aspects of this representation are considered. 

Geometric Morphisms. Special properties of geometric mor- 
phisms have been examined in a number of cases: (a) open geo- 
metric morphisms (also briefly discussed in our Chapter IX); (b) lo- 
cally connected (or “molecular”) morphisms—generalizing those maps 
of spaces in which the fibers are locally connected; (c) atomic geomet- 
ric morphisms—those whose inverse image functor is a logical functor; 
(d) localic morphisms—a relative version of localic topoi; (e) local ge- 
ometric morphisms—generalizing the spectrum of a local ring. Useful 
references in this direction are: for (a), Johnstone (1980) and JT; for 
(b), Barr, Paré (1980) or the appendix of Moerdijk (1986); for (c), Barr, 
Diaconescu (1980) and JT; for (d), JT and Johnstone (1981); for (e), 
Johnstone, Moerdijk (1989). 

The category of Grothendieck topoi and geometric morphisms be- 
tween them has all small limits and colimits (in the appropriate 2- 
categorical sense), but their constructions are quite involved: pullbacks 
are described in Giraud (1972) and Diaconescu (1975), while filtered 
inverse limits already occur in SGA4 and are further studied in Mo- 
erdijk (1986); the existence of all small colimits occurs in Moerdijk 
(1988), and, more systematically, in Makkai, Paré (1989). 

Topoi and Algebraic Topology. A topos, as a kind of general- 
ized space, is open to the methods of algebraic topology, for example, 
to simplicial methods. This intriguing area has not been developed sys- 
tematically, so we will just mention some suggestive sources. An early 
example is Artin, Mazur (1969), where the “étale’” homotopy groups 
net(E, p) are defined for a topos € with a base point p: Sets — €, using 
Verdier’s “hypercovers” of €. For the topos of sheaves on a “good” space 
X, these étale homotopy groups are shown to coincide with the classical 
ones. The hypercovers used here are suitable contractible simplicial ob- 
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jects in €, which Verdier used earlier to compute the cohomology of the 
topos €. A more “rigid” version of étale homotopy has been developed 
in the context of simplicial schemes by Friedlander (1982), with the aim 
of solving the Adams conjecture in classical homotopy theory by meth- 
ods from algebraic geometry (“in characteristic p”); in this context the 
papers by Quillen (1968) and Joshua (1987) are also relevant. 

Joyal and Wraith (1983) showed how the cohomology of a topos 
can be classified (in the sense of our Chapter VIII) by a suitable “Eilen- 
berg, Mac Lane” topos K-(z,n). By simplicial methods they showed that 
this topos K-(7,n) is cohomologically equivalent to the usual Eilenberg- 
Mac Lane space K'(7,n) of algebraic topology. 

The “closed model structures” of Quillen have been influential: in 
(1967) and (1969) he showed that much of homotopy theory can be 
developed on the basis of the axioms (for the “fibrations”, “cofibrations” 
and “weak equivalences”) of such a model structure. One reason that 
simplicial techniques apply well to topoi is that the simplicial objects in 
a Grothendieck topos have such a closed model structure, as shown by 
A. Joyal in an elegant, as yet unpublished, letter (1984) to Grothendieck. 
A related older paper is Brown (1973), which gives for simplicial objects 
in a sheaf topos a weaker “local” version of a Quillen model structure. 
These simplicial techniques apply also in the context of foliations of 
manifolds. Here the usual “quotient space”, with points the leaves of 
the foliation, is usually too degenerate. A. Haefliger (1958), W. T. van 
Est (1984), and many others have proposed modified such “quotients”. 
Moerdijk (1991) shows that the homotopy and cohomology groups of 
such a modified “quotient” can be realized as the corresponding groups of 
an appropriate topos of “foliation-invariant” sheaves. Homotopy theory 
of topoi is also implicit in the use of simplicial techniques in K-theory 
and in related topics; cf., e.g., Quillen (1973). Jardine’s (1986) paper 
describes in more detail the methods from the letter by Joyal mentioned 
above, and applies these in the context of Suslin’s computations for 
the K-groups of an algebraically closed field. Thomason (1985) uses 
simplicial techniques for topoi to compare algebraic and topological K- 
theory. 

Synthetic Differential Geometry (SDG). Several recent devel- 
opments have rigorously formulated the properties of infinitesimals—as 
they were once used informally in classical analysis and differential ge- 
ometry. Robinson’s Non-Standard Analysis provides such a formulation 
for invertible infinitesimals. Synthetic Differential Geometry (SDG), on 
the other hand, provides a categorical approach to both nilpotent and 
invertible infinitesimals. It was initiated by Lawvere in 1967, while the 
first topos-theoretic models were constructed by Dubuc [the best ref- 
erence is Dubuc (1981)]). The early text by Kock (1981) presented 
both a naive (i.e., axiomatic) approach and a categorical model, while 
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Lavendhomme’s book (1987) provides an extensive and elegant presen- 
tation from the naive, synthetic, point of view. Moerdijk, Reyes (1991) 
emphasize topos-theoretic models, as well as the relation to classical 
analysis and nonstandard analysis. 

These indications cover only a few of the possible lines of develop- 
ment of topos theory. Others may arise, with topoi as carriers of new 
cohomology theories, or as vehicles for the semantics of other logics, or 
as background for simplicial techniques. 
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quotient, 256ff 
filtered 
category, 385 
colimit, 96, 390 
limit, 385 
filtering 
category, 385 
equals flat, 386, 399 
functor, 386, 394, 395 


Index 


final object, 194 
finer topology, 113, 263 Ex. 2 
finite 
limit, 30 
states of knowledge, 283 
finitely presented 
algebra, 119 
ring, 438 
first order theories, 527ff 
flat, 381 
equals filtering, 386, 399 
equals left exact, 390 
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principal-, 81, 421 
G-map, 238 
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hom-tensor adjunction, 357ff, 578 

homology groups, 452 

homomorphism 
of Heyting algebras (internal), 200 
of lattices (internal), 200 
of interpretations, 530, 532 
of models, 530, 532 

hypersurface, 117 


ideal, 116 

closed-, 590 

in a poset, 590 

maximal-, 117, 344 Ex. 4, 

Ex. 6 

prime-, 117 

principal, 590 

radical-, 117 
identity morphism, 10 
image, 184, 570 


Ind 


direct-, 68, 99, 171, 348 

inverse-, 33, 99, 147, 348 

of sublocale, 503 
implication, 48, 50, 114, 146, 199 
inclusion in coproduct, 17 
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